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FOREWORD 

The  Operations  Research  Center  at  the  Massachusetts  Insti- 
tute of  Technology  is  an  interdepartmental  activity  devoted  to 
graduate  education  and  research  in  the  field  of  operations  re- 
search. The  work  of  the  Center  is  supported,  in  part,  by  govern- 
ment contracts  and  grants.  The  work  reported  herein  was  supported 
(in  part)  by  the  U.S.  Army  Research  Office  under  Contract 
DAAG29-76-C-0064. 

The  author  would  also  like  to  acknowledge  the  support  of 
the  National  Science  Foundation  under  Grant  MCS  77-07327. 
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ABSTRACT 


An  algorithm  is  presented  for  the  two  duty  period  schedul- 
ing problem.  This  integer  programming  problem  has  a binary 
constraint  matrix  with  two  sets  of  consecutive  ones  in  each 
column.  At  each  subproblem  of  a branch  and  bound  procedure, 
subgradient  optimization  is  used  to  maximize  the  value  of  a 
Lagrangean  relaxation,  which  is  a network  flow  problem.  The 
algorithm  is  implemented  for  the  two  duty  period  set  parti- 
tioning problem,  i/ith  shortest  path  relaxations.  A second 
algorithm  utilizing  the  unique  properties  of  prime  numbers  is 
developed  for  solving  small  subproblems.  Computational  results 
are  reported  for  several  large  problems., 
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Lagrangean  Relaxation  and  the  Two  Duty 
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CHAPTER  1 


INTRODUCTION 


In  recent  years  we  have  witnessed  considerable 
advances  in  mathematical  programming.  One  productive 
area  of  concentration  has  concerned  the  use  of 
Lagrangean  relaxation  coupled  with  creative  problem 
formulation.  Manifestations  have  included  Held  and 
Karp's  (1970,1971)  approach  to  the  traveling  salesman 
problem,  Fisher's  (1972,1973)  study  of  resource 
constrained  machine  sequencing,  and  applications  to 
multicommodity  flow  problems  [ Assad  (1576) , Kennington 
and  Shalaby  (1977)  ]. 

The  notion  of  subgradients  has  also  attracted  a 
great  deal  of  research.  Held  and  Karp  (1970,1971)  and 
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Held,  Rolfe  and  Crowder  (1974)  demonstrate  its 
application  to  the  traveling  salesman  problem  and  the 
Bulticoanodity  maximum  flow  problem.  Assad  (1976)  and 
Kennington  and  Shalaby  (1977)  have  used  subgradient 
optimization  for  the  minimal  cost  multicommodity  flow 
problem.  Etcheberry  (1976)  has  utilized  the  technique 
in  a new  algorithm  for  set  partitioning  and  set  covering 
problems.  Corpuejols,  Fisher  and  Nemhauser  (1977)  have 
used  this  solution  strategy  to  solve  facility  location 
models. 

Another  advance  in  mathematical  programming  has 
been  the  use  of  group  theoretic  and  other  knapsack 
equivalents  of  arbitrary  integer  programs.  Fisher, 
Northup  and  Shapiro  (1975)  report  computational 
experience  with  the  group  theoretic  approach.  Garfinkel 
and  Nemhauser  (1972)  give  an  alternative  knapsack 
equivalent  of  an  integer  program.  Glover  (1967) , 

Shapiro  and  Wagner  (1967)  and  Shapiro  (1968)  offer 
methods  for  solving  the  knapsack  problem  in  a dynamic 
programming  - shortest  path  context. 

Another  fruitful  area  for  recent  research  has  been 
the  combination  of  algorithmic  capabilities  of 
operations  research  and  computer  science.  The  use  of 
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efficient  list  processing  techniques  from  computer 
science  has  in  many  cases  led  to  order  of  magnitude 
improvements  in  computational  efficiency.  Magnanti 
(1976)  and  Magnanti  and  Golden  (1978)  survey  many  recent 
instances. 

In  this  thesis  we  embellish  upon  all  of  these 
developments  in  the  context  of  certain  personnel 
scheduling  problems.  Our  research  has  extended  into 
two  main  areas; 

1)  the  study  of  efficient  dynamic  programming  - 
shortest  path  solution  techniques  for  certain  integer 
programs.  Part  of  this  investigation  has  led  to  a new 
method  for  transforming  integer  programs  into  equivalent 
knapsack  problems.  Computational  experience  has  shown 
the  effectiveness  of  these  techniques  for  small  set 
partitioning  problems. 

2)  the  use  of  Lagrangean  relaxation  and  subgradient 
optimization  for  a certain  class  of  personnel  scheduling 
problems.  The  two  duty  period  set  partitioning  problem 
we  study  has  a natural  shortest  path  relaxation.  For 
each  subproblem  in  a branch  and  bound  strategy  we  use  an 
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iterative  backward-forward  reaching  shortest  path 
algorithm  and  subgradient  optimization  for  computing 
strong  bounds.  This  approach  is  an  outgrowth  of  an 
observation  by  Veinott  and  Wagner  (1962)  that  certain 
very  specialized  scheduling  problems  can  be  solved 
directly  as  shortest  paths. 

The  remainder  of  this  introductory  chapter 
outlines  the  problems  to  be  studied  and  their 
applications.  We  first  summarize  the  methodology  of  our 
knapsack  transformation  and  its  use  within  a dynamic 
programming  - shortest  path  framework  and  then 
introduce  the  personnel  scheduling  problem.  He  conclude 
the  chapter  with  a more  detailed  summary  of  the 
remainder  of  this  work. 


1.  1 Knapsack  Equivalents  of  Integer  Programs 

Consider  any  integer  programming  problem  (IP)  (with 
St  meaning  subject  to) 
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(IP)  Min  CX 

St  £ A .X  . = b 

j = 1 ' ' 

X in  S 

M 

where  and  b are  integer  K-vectors  in  Z and  where 

N 

S,  a given  subset  of  Z , captures  the  discrete  nature 

characteristics  of  the  problem,  e.g.  S might  be  the  set 

z”  of  nonnegative  integer  N-vectors,  the  set  of  0,1 

integer  vectors,  or  some  other  more  complicated  set. 

Given  a one-to-one  linear  mapping  P from  the  integer 
M 

M-vectors  Z into  the  real  numbers  R,  (IP)  may  be 
rewritten  as  an  equivalent  problem; 

(KIP)  Kin  CX 

N 

St  E P(^  ) X.  = P(b) 
j = l 3 3 

X in  S 

In  Chapter  5 we  study  the  knapsack  problem  (KIP) 
defined  by  a mapping  P which  utilizes  the  unique 
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properties  of  the  prime  numbers.  Given  any  distinct 


primes  P^,P2#.  • • the  mapping  P defined  by 

P:  Z**  ->  R 

M 

P(Z)  = ]C 
i=l 

is  a one-to-one  linear  mapping  of  the  integer 
M-vectors  into  the  real  numbers.  (KIP),  derived  with 
this  mapping  P,  is  a knapsack  problem  with  irrational 
constraint  coefficients.  If  the  variables  are  bounded 
and  the  irrational  coefficients  P{Aj),P(b)  are  closely 
enough  approximated  by  rational  numbers  At,b*,  the 
resulting  problem  is  equivalent  to  (KIP)  and  (IP)  (see 
Proposition  5.1,  Section  5.3  for  a proof). 

A common  solution  strategy  for  solving  these 

knapsack  equivalents  of  integer  programs  is  dynamic 

programming  or  shortest  paths.  A useful  property  for 

this  algorithmic  approach  is  the  property  of  optimal 

subsolutions.  For  integer  programming  problems  (IP) 

M 

with  S = Z^,  any  partial  solution  X of  an  optimal 
solution  X*  (i.e.  4 Xt) , X is  optimal  for  the 

resources  it  uses.  That  is,  X is  optimal  in 


20 


Hin 


CX 


N 

St  23  A.X. 
j = l ^ ^ 


N 

E AjX. 
j = l J ^ 


X in 


This  property  is  very  useful  when  one  considers  the 
knapsack  reformulation  of  (IP) . The  implication  is, 
when  using  dynamic  programming  - shortest  path 
techniques  to  solve  (KIP) , one  may  impose  an  arbitrary 
ordering  upon  the  variables  of  the  solution.  Rather 
than  consider  every  variable  as  an  arc  at  every  node  of 
the  shortest  path  interpretation  of  (KIP) , we  need 
consider  only  a subset  of  the  variables  as  arcs  at 
each  node. 


This  property  of  optimal  subsolutions  depends  upon 
the  problem  having  unbounded  variables.  However,  for 
some  0,1  integer  problems  the  upper  bounds  are  enforced 
naturally  by  the  constraints  (without  explicit  upper 
bounds  on  variables)  or  by  the  objective  function.  The 
set  partitioning  problem  is  a case  where  the  constraints 
force  each  variable  to  be  between  zero  and  one. 

Multiple  choice  constraints  also  have  this  property. 
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The  set  covering  problem  with  positive  cost  coefficients 
is  an  example  where  the  objective  function  forces  the 
0,1  condition. 


In  Chapter  5 we  exploit  the  property  of  optimal 
subsolutions  in  our  development  of  algorithms  for  the  set 
partitioning  and  set  covering  problems.  We  order  the 
variables  according  to  the  first  row  with  a nonzero 
entry.  Consequently,  for  the  set  partitioning  problem, 
at  a given  node  of  the  shortest  path  problem  we  need 
consider  only  those  variables  whose  first  entry  is  in 
the  first  constraint  not  yet  satisfied.  For  the  set 
covering  problem  we  need  consider  only  those  variables 
with  an  entry  in  the  first  constraint  not  yet  satisfied. 

These  ideas  have  a broader  application.  In  the 
case  of  a general  integer  programming  problem  (IP)  with 
unbounded  variables  and  a sparse  coefficient  matrix, 
there  also  is  a good  ordering  of  the  variables.  At  each 
node  of  the  shortest  path  interpretation  of  the  knapsack 
formulation  of  (IP)  one  need  consider  only  those 
variables  which  will  contribute  to  satisfying  the  first 
constraint  not  satisfied  at  that  node.  In  this  way,  at 
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each  node  one  need  consider  only  a small  subset  of  the 
arcs  leaving  that  node. 


In  this  paper  we  will  apply  these  ideas  to  a 
particular  problem  - the  two  duty  period  set 
partitioning  scheduling  problem,  which  is  a special  case 
of  the  class  of  K duty  period  scheduling  problems.  i:e 
will  first  use  duality  to  develop  a relaxation  for  the 
two  duty  period  scheduling  problem.  This  Lagrangean 
relaxation  is  an  acyclic  shortest  path  problem.  We 
suggest  an  iterative  backward-f orvrard  reaching  algorithm 
for  solving  the  relaxation.  Subgradient  optimization  is 
used  to  maximize  the  Lagrangean.  The  entire  process  is 
embedded  in  a branch  and  bound  strategy  for  solving  the 
scheduling  problem  optimally. 

For  subproblers  with  dimensions  of  manageable  size, 
we  reformulate  the  subproblem  as  a knapsack  problem 
using  the  prime  number  mapping  P.  Because  the  original 
subproblem  is  a set  partitioning  problem  and  has  the 
property  of  optimal  subsolutions,  the  knapsack 
formulation  can  be  solved  efficiently  using  a 
predetermined  ordering  of  the  variables,  as  mentioned 
earlier. 
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The  computational  results  reported  in  Chapter  7 
show  that  this  solution  strategy  for  the  two  duty  period 
scheduling  problem  has  led  to  significant  improvements 
in  solution  times  over  the  best  existing  algorithms  for 
this  class  of  problems. 

1.2  The  Two  Duty  Period  Scheduling  Problem 

Personnel  scheduling  problems  frequently  give  rise 
to  integer  programming  problems.  We  now  introduce  the 
two  duty  period  scheduling  problem,  an  integer 
programming  problem  with  special  structure.  A special 
case  of  the  two  duty  period  scheduling  problem  arises 
when  the  problem  is  a set  partitioning  problem.  This 
special  case  will  be  called  the  Helsinki  problem. 


1.2.1  Formulation  of  the  Two  Duty  Period  Scheduling 
Problem 


t 

A two  duty  period  scheduling  problem  may  be  defined  | 
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(P) 

Hin 

cx 

St 

AX  ^ 

b 

X 

Integer  and  Honnegative 

where  b is  an  M X 1 vector,  C is  a 1 X N vector,  X is  an 
N X 1 vector  and  where  the  entries,  a^j/  of  the  M X N 
constraint  matrix  A are  either  zero  or  one  and  each 
column  of  A has  at  most  two  segments  of  ones.  A SEGMENT 
of  ones  is  a consecutive  set  of  column  elements 
i = k,k+ 1, . . . 1,k+p  such  that 


II 

*n 

(0 

1 

1 ” k/k^1,... 

,k+p-1 ,k+p 

or  a ■ . = 

1 3 

-1 

i = k,k+1,... 

, k+p- 1, k+p 

0 

if  k > 1 

®k+p+l,j 

0 

if  k+p  < M 

The  symbol  indicates  that  the  constraints  may 
be  equalities  or  oither  less  than  or  greater  than 
inequalities. 

A special  case  of  the  two  duty  period  scheduling 
problem  which  will  receive  much  attention  in  this  work 
is  the  Helsinki  problem  or  two  duty  period  set 
partitioning  problem  (HP) 
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where  each  column  of  A contains  at  most  two  segments  of 
ones,  the  rest  of  the  entries  being  zero.  The  right 
hand  side  elements  are  all  ones.  The  notation  X = 0, 1 
indicates  that  each  entry  of  the  vector  X is  either  0 or  1 

1.2.2  Interpretation  and  Applications  of  the  Two 
Duty  Period  Scheduling  Problem 

The  two  duty  period  scheduling  problem  arises  very 
naturally  in  personnel  scheduling.  Consider  a situation 
where  there  are  a number  of  jobs  each  of  which  must  be 
done  by  a single  person  for  a given  period  of  time.  As 
an  illustrative  example  we  consider  buses  and  drivers. 

It  was  in  this  form  that  the  problem  was  first  brought 
to  the  author's  attention  by  Harkku  Tamminen  of  the 
Helsinki  Data  Center,  Finland  [Koljonen  & Tamminen 
(1977)  ].  Consider  Figure  1.1  where  rows  1 and  2 
correspond  to  the  two  hours  of  operation  of  Bus  A and 
rows  3 through  8 correspond  to  six  hours  of  continuous 
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An  Example  of  the  Two  Duty  Period  Set  Partitioning  Problem 

The  Helsinki  Problem 


Figure  1 . 1 
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operation  of  Pus  B.  Each  coluian  corresponds  to  a 
possible  driver  schedule  where  an  entry  of  1 in  the 
matrix  indicates  that  the  driver  of  that  column  drives 
the  bus  of  that  row  for  the  hour  corresponding  to  that 
row.  With  the  restriction  that  a driver  drive  no  more 
than  one  bus  during  his  morning  duty  period  and  no  more 
than  one  (possibly  different)  bus  during  his  afternoon 
duty  period,  each  column  will  contain  at  most  two 
segments.  The  problem  is  to  find  a set  of  minimal  cost 
of  driver  schedules  such  that  each  bus  will  have  exactly 
one  driver  during  each  of  its  hours  of  operation. 

Other  cases  of  two  duty  period  scheduling  problems 
which  have  been  cited  in  the  literature  include  cyclic 
staffing  with  overtime  [Bartholdi,  Orlin  G Batliff 
(1977)  ] and  days  off  scheduling  [Brownell  G Lowerre 
(1976),  Tibrewala,  Philippe  G Browne  (1972),  Bartholdi, 
Orlin  G Batliff  (1977) , Bartholdi  G Batliff  (1977) , 

Orlin  (1977)  ].  Glover  and  Mulvey  (1976)  refer  to 
implementations  of  project  partitioning,  job  processing, 
and  monitoring  and  maintenance  scheduling  problems. 

Other  authors  who  have  studied  very  similar  problems 
include  Baker  (1974,  1975) , Maier-Bothe  (1973) , and 


A special  case  of  the  two  duty  period  scheduling 


problem  is  the  circular  ones  and  cyclic  staffing 
problem.  This  is  a problem  of  scheduling  personnel 
working  continuous  duty  periods  in  cyclic  time.  See 
Figure  1.2.  Each  column  represents  a possible  work 
schedule  of  one  continuous  duty  period  in  real  time. 
However,  due  to  the  arbitrary  nature  of  starting  a day 
at  midnight  in  the  matrix  representation,  a duty  period 
beginning  late  one  day  and  finishing  early  the  next 
becomes  two  segments  in  the  corresponding  column. 
Efficient  solution  procedures  are  known  for  certain 
cases  of  this  problem  [Orlin,  Bartholdi  6 Ratliff 
(1977)  ]. 

The  two  duty  period  scheduling  problem  can  be 
generalized  to  a K duty  period  scheduling  problem  where 
each  column  of  the  constraint  matrix  contains  at  most  K 
segments.  For  example,  the  problem  of  scheduling 
personnel  who  work  five  continuous  duty  periods  in  a 
week  might  be  formulated  as  a five  duty  period 
scheduling  problem  in  linear  time  or  as  a six  duty 
period  scheduling  problem  in  cyclic  time. 
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An  Example  of  the  Circular  Ones  - Cyclical  Staffing  Problem 


Figure  1.2 
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1.2.3 


Solution  Techniques 


The  simplest  of  this  class  of  problems,  the  one 
duty  period  scheduling  problem,  is  known  to  have  a 
unimodular  matrix  [Veinott  & Wagner  (1962),  Garfinkel  & 
Nemhauser  (1972)  ].  In  Chapter  2 we  will  demonstrate  how 
it  may  be  easily  reformulated  as  a network  flow  problem. 

The  general  two  duty  period  scheduling  problem  (P) 
is  a general  integer  programiaing  problem.  As  such  it 
can  be  solved  by  a number  of  techniques  - cutting  plane, 
enumeration,  or  group  theoretic.  However,  no  good 
specialized  algorithms  have  been  developed  which  might 
solve  the  two  duty  period  problem  in  better  times  than 
might  be  expected  from  a general  integer  programming 
code. 

1.3  Organization  of  the  Paper 

1.3.1  An  Algorithm  for  the  Helsinki  Problem 

In  Chapter  2 we  will  indicate  how  we  can  take 
advantage  of  the  special  structure  of  the  two  duty 
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period  scheduling  problem  to  develop  an  algorithm  for 
solving  it  efficiently.  In  Chapter  3 ve  will  develop 
this  algorithm  in  great  detail  for  the  special  case,  the 
Helsinki  problem.  He  will  develop  a branch  and  bound 
enumeration  technique,  in  which  the  relaxation  solved 
for  each  subproblem  will  be  a shortest  path  problem. 

In  Chapter  4 we  introduce  the  Lagrangean  relaxation 
of  the  Helsinki  problem  in  order  to  get  tighter  bounds 
for  our  branch  and  bound  procedure.  To  evaluate  the 
Lagrangean,  ve  need  only  solve  a shortest  path  problem. 
When  subgradient  optimization  is  used  to  maximize  the 
Lagrangean,  the  extra  computational  burden  is  more  than 
offset  by  the  reduction  in  the  number  of  subproblems 
generated. 


1.3.2  Prime  Numbers  and  the  HELSINKI  Algorithm 

In  Chapter  5 we  demonstrate  how  prime  numbers  can 
be  used  to  reformulate  any  integer  programming  problem 
as  a knapsack  problem.  Interpreting  this  knapsack  is 
problem  as  a shortest  path  problem,  leads  to  a very 
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efficient  solution  procedure  whenever  the  original 
integer  programming  problem  is  a set  partitioning 
problem.  The  existence  of  this  efficient  solution 
technique  depends  on  the  use  of  logarithms  of  primes  as 
multipliers  in  obtaining  a surrogate  constraint  for  the 
original  constraints.  A somewhat  less  efficient 
algorithm  is  developed  for  solving  the  resulting 
shortest  path  problem  when  the  original  problem  is  a set 
covering  problem. 

The  prime  number  - shortest  path  method  is  severely 
limited  in  the  size  of  problem  it  can  handle.  However, 
we  have  incorporated  it  into  the  HELSINKI  algorithm  to 
be  used  when  the  subproblem  to  be  solved  is  of  small 
enough  dimensions.  Used  in  this  way,  the  prime  number  - 
shortest  path  technique  is  guaranteed  to  fathom  a node 
when  used.  Ke  found  it  to  be  very  effective  in 
eliminating  the  need  to  separate  nodes  at  the  lower 
levels  of  the  tree  search. 

1.3.3  Side  Constraints 

In  Chapters  5 and  7 we  develop  two  procedures  by 
which  side  constraints  of  the  form 
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t .X  . < T t . < 0 

J D “ J 

can  be  incorporated  into  the  HELSINKI  algorithm 
Computational  experience  with  the  constraint 

N 

E i ' 

j = l J 


limiting  the  number  of  variables  in  the  solution, 
indicates  that  the  side  constraints  tend  to  slow  the 
algorithm  slightly.  However,  in  some  cases, 
particularly  where  the  constraint  cannot  be  easily 
satisfied,  its  effect  is  to  significantly  increase  the 
speed  of  the  algorithm. 


1.3.4  Extensions,  Discussion  and  Computational 
Experience 

Chapter  6 generalizes  the  results  of  the  previous 
chapters  to  the  general  two  duty  period  scheduling 
problem  (P)  . We  show  that  the  relaxation  is  a network 
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flow  problem.  Again,  an  onumerative  procedure  can  be 
used  to  solve  the  general  two  duty  period  scheduling 
problem.  Subgradient  optimization  is  used  to  maximize 
the  Lagrangean  relaxation  of  (P)  at  each  subproblem. 

Our  computational  experience  with  the  HELSINKI 
algorithm  is  presented  in  Chapter  7.  Our  algorithm  has 
been  tested  on  four  real  world  probles  of  average  size 
65  X 475  (as  well  as  on  a number  of  small  test  problems 
artificially  generated) . Since  the  Helsinki  problem  is 
a set  partitioning  problem,  specialized  algorithms  have 
been  developed  for  solving  it  quite  efficiently. 

However,  the  HELSINKI  algorithm  consistently 
outperformed  Harsten's  SETPAK  set  partitioning  algorithm 
[ Marsten  (1971)],  generally  regarded  as  the  best 
available  set  partitioning  algorithm.  Tests  done  by 
Tamminen  on  one  test  problem  with  a number  of  set 
partitioning  algorithms  (including  the  Garfinkel  6 
Nemhauser  algorithm)  led  that  investigator  to  conclude 
that  SETPAR  outperformed  all  others  (the  HELSINKI 
algorithm  was  not  developed  at  that  time)  [ Kol jonen  6 
Tamminen  (1977)  ]. 


Chapter  8 deals  with  a special  case  of  the  two  duty 
period  scheduling  problem,  the  circular  ones  problem. 
This  is  a scheduling  problem  in  cyclic  time. 

The  results  of  the  earlier  chapters  are  generalized 
for  the  K duty  period  scheduling  problem  in  Chapter  9. 
This  K duty  period  problem  corresponds  to  an  integer 
program  with  at  most  K segments  of  ones  in  each  column, 
other  entries  being  zero.  He  show  that  the  same 
theoretical  development  applies. 

Finally,  in  the  last  chapter,  we  examine  the 
general  methodology  used  in  developing  the  HELSINKI 
algorithm.  The  idea  of  decoupling  columns,  which  is 
used  to  reformulate  the  Helsinki  problem  as  a shortest 
path  problem  with  side  constraints,  is  shown  to  be 
applicable  to  near  block  diagonal  matrices.  There  also 
exists  a corresponding  concept  of  decoupling  rows. 
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CHAPTER  2 


r 


SPECIAL  STRUCTURE  OF  THE  TWO  DUTY  PERIOD  SCHEDULING 

PROBLEM 

In  this  chapter  we  will  develop  a metnodology  for 
solving  the  two  duty  period  scheduling  problem.  He  will 
consider  first  the  one  duty  period  scheduling  problem 
and  show  how  it  may  be  transformed  to  a network  flow 
problem.  Then  in  Sections  2.ii  and  2.5  we  will  show  how 
the  two  duty  period  scheduling  problem  may  be 
reformulated  as  a one  duty  period  problem  with  side 
constraints.  Equivalently r the  two  duty  period  problem 
nay  be  reformulated  as  a network  flow  problem  with  side 
constraints. 
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The  One  Duty  Period  Set  Partitioning  Problem 


The  one  duty  period  set  partitioning  problem  may  be 


defined  as 


(1HP)  Min  CX 

St  AX  = 1 

X = 0,1 


where  the  entries  of  the  M X S constraint  matrix  A are 
either  zero  or  one  and  each  column  A^,  j = 
contains  at  most  one  segment  of  ones.  A SEGMENT  S of  A. 
is  defined  to  be  a subset  of  rows,  S = {p, p+ 1 / . . . / p+k} , 
such  that  a^j  = 1 for  all  i in  S (or  a^j  = -1  for  all  i 
in  S)  and  a^  , . = 0 if  p / 1 and  a„..  . , . = 0 if  p+k  f 

p- X I 3 P'rA^TX/J 


An  example  is  given  in  Figure  2. 1. 
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An  Example  of  the  One  Duty  Period  Set  Partitioning  Problem  (IHP) 


Figure  2.1 
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2.2 


Transforming  the  One  Duty  Period  Set  Partitioning 
Problem  to  a Shortest  Path  Problem 


The  one  duty  period  set  partitioning  problem  is 
known  to  have  a unimodular  constraint  matrix  A 
[Garfinkel  & Nemhauser  (1972)/  Veinott  6 Wagner  (1962)  ]. 
It  follows  that  the  one  duty  period  set  partitioning 
problem  can  be  represented  as  a shortest  path  problem 
with  (M+1)  nodes  and  N arcs.  The  set  partitioning 
problem  is  to  choose  a set  of  columns  that  together 
cover  every  row  exactly  once.  Consider  a segment  S 
covering  rows  p through  p+k.  This  may  be  represented  by 
an  arc  originating  at  node  p and  terminating  at  node 
p+k+1.  See  Figure  2.2.  Then  each  feasible  solution  to 
the  one  duty  period  set  partitioning  problem  will 
correspond  to  a path  from  node  1 to  node  M+1. 

Each  arc  of  this  shortest  path  interpretation  of 
the  one  duty  period  set  partitioning  problem  is  directed 
in  the  sense  of  originating  at  the  lowest  row  number  in 
the  segment  to  which  it  corresponds.  Consequently,  the 
graph  is  acyclic. 
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Consider  the  folloving  transfornations  on  the  H X N 
constraint  aatrix  A of  (1HP).  First  append  the  trivial 
constraint 


0 = 0 

giving  the  (M+1)  X N constraint  matrix  [A'|b].  Let 

T = (t,^  ) 

be  the  (M  + 1)  X (M'H)  matrix  such  that 

= -1,  i = 

t^^  = Of  otherwise 

See  Figure  2.3.  T simply  subtracts  from  each  row  of  A* 
(except  the  first)  the  preceding  row.  Since  T is 
nonsingular  the  constraint  set  (TA»|Tt]  is  equivalent  to 
[A|1].  The  effect  of  the  transformation  has  been  to 
create  an  equivalent  problem  (1HP*) 
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(1HP*)  Min  CX 

St  TA'X  = T1 

X Integer,  Nonnegative 

where  each  column  of  (TA')  contains  exactly  one  1 and 
one  -1,  the  rest  of  the  entries  being  zero.  See  Figure 
2.4.  In  other  words  TA'X  = T1  represents  a shortest 
path  problem. 


2.3  Transforming  the  General  One  Duty  Period  Scheduling 

Problem  to  a Network  Flow  Problem  ' | 

Consider  now  the  more  general  one  duty  period 
scheduling  problem. 

(IP)  Min  CX 

St  AX  > b 

X Integer,  Nonnegative 

where,  once  again,  each  column  of  A contains  exactly  one 
segment  of  ones,  the  rest  of  the  entries  being  zero. 
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See  the  example  in  Figure  2.5.  Then  (IP)  can  be 
transformed  to 


r 


(IP)  Min  CX 

St  AX  - lY  = b 

X Integer,  Nonnegative 

by  the  addition  of  surplus  variables  Y.  See  the  example 
in  Figure  2.6.  Notice  that  (IP)  still  satisfies  the 
criterion  that  each  column  have  exactly  one  segment.  It 
is  also  the  case  for  less  than  or  equal  to  inequalities, 
AX  4 b,  that  the  addition  of  slack  variables  will  lead 
to  a reformulation  satisfying  the  criterion  that  each 
column  have  exactly  one  segment. 

In  the  same  manner  that  (1HP)  was  transformed  to  an 
acyclic  shortest  path  problem,  (IP)  may  be  transformed 
to  a network  flow  problem. 


Consider  the  example  given  in  Figure  2.6.  Adding 
the  trivial  row 
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An  Example  of  the  General  One  Duty  Period  Scheduling  Problem  (IP) 


Figure  2.5 
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gives  the  matrix  in  Figure  2.7.  Then,  after  applying 
the  transformation  T,  one  gets  the  resulting  matrix 
depicted  in  Figure  2.8.  Our  one  duty  period  scheduling 
problem  has  been  transformed  to  a network  flow  problem. 

In  the  one  duty  period  set  partitioning  problem 

(1HP)  one  unit  of  flow  is  to  pass  through,  a path  from 

node  1 to  node  M+1.  In  the  more  general  problem  (IP)  we 

must  have  b^  units  of  flow  passing  through  each  node  i. 

Consequently,  at  each  node  we  may  have  to  introduce  or 

withdraw  units  of  flow.  Precisely,  we  must  introduce 

(or  withdraw)  b^^  - b^_j^  units  of  flow  at  node  i,  where 

bp  = 0.  Now,  we  must  add  a source  (Node  0)  as  well  as  a 

sink  (Node  M + 2) . See  Figure  2.9,  where  arcs 

originating  at  node  0 introduce  flow  to  nodes  and  arcs 

ending  at  node  10  withdraw  flow  from  nodes.  If  the 

constraints  of  (IP)  had  all  been  equations  or  less  than 

or  equal  to  inequalities,  then  the  corresponding  network 

flow  representation  would  have  been  acyclic.  As  it  is, 

each  variable  Y.  becomes  an  arc  from  node  i -«■  1 to 
1 

node  i. 
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Figure  2.7 
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Figure  2.8 


2.4  Reformulating  the  Two  Duty  Period  Set  Partitioning 
Problem  as  a Shortest  Path  Problem  with  Side 
Constraints 


Consider  now  the  two  duty  period  set  partitioning 
problem  which  we  will  call  the  Helsinki  problem  (HP) 

(HP)  Min  CX 

st  AX  = 1 
X = C,  1 

where  each  column  of  A contains  one  or  two  segments  of 
ones,  the  rest  of  the  entries  being  zero.  An  example  is 
given  in  Figure  2.10.  This  example  corresponds  to  the 
bus  driver  example  given  in  Figure  1.1  in  Chapter  1. 

(HP)  and  the  example  in  Figure  2.10  are  both  set 
partitioning  problems.  However,  both  have  the  special 
property  of  having  at  most  two  segments  in  each  column. 

We  know  that  the  one  duty  period  set  partitioning 
problem  is  easily  solved.  What  implications  does  this 
have  for  the  two  duty  period  set  partitioning  problem? 
The  shortest  path  interpretation  of  the  single  segment 
set  partitioning  problem  suggests  a natural  relaxation 
for  the  two  segment  problem. 
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I 


An  Example  of  the  Two  Duty  Period  Set  Partitioning  Problem 

Figure  2.10 


Consider  again  the  Helsinki  problem  example  in 


Figure  2. 10.  Suppose  each  column  containing  two 

1 2 

segments  were  separated  into  two  columns,  A^  and  A^, 
each  containing  one  segment,  see  Figure  2.11.  The 
matrix  in  Figure  2.11  corresponds  exactly  to  that  of 
Figure  2. 1. 

In  terms  of  our  shortest  path  interpretation,  each 

variable  of  the  original  problem  corresponds  to  two 
1 2 

arcs,  Xj  and  X^,  (see  Figure  2.12).  Dividing  the 
columns  decouples  the  arcs  (see  Figure  2.13).  If  we 
impose  the  constraints  that  each  arc  must  have  the  same 
value  as  its  partner  (i.e.  = X^ , j = 1,...,N)  then 

our  new  problem  (HP*)  is  equivalent  to  the  original 
problem.  Proposition  2.1  formalizes  this  idea  in 
Section  2.6. 
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where  j “ 1 1 2 , • . . 1 5 


The  Helsinki  Problem  of  Figure  2.10  with  Decoupled  Columns 


Figure  2.11 
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That  is,  the  Helsinki  problem  is  equivalent  to  an 


acyclic  shortest  path  problem  with  coupled  arcs. 
Similarly,  it  can  be  shown  that  the  more  general  two 
duty  period  scheduling  problem  is  equivalent  to  a 
network  flow  problem  with  coupled  arcs. 


2.5  Reformulating  the  General  Two  Duty  Period 

Scheduling  Problem  as  a Network  Flow  Problem  with 
Side  Constraints 

Consider  now  the  general  two  duty  period  scheduling 
problem  (P)  (see  Figure  2.14) 


(P) 

r 


Min  CX 

St  AX  = b 

X Integer,  Nonnegative 


where  each  column  of  A contains  at  most  two  segments  of 
ones,  other  entries  being  zero.  Inequalities  in  the 
original  constraints  may  be  transformed  to  equations  by 
the  use  of  surplus  and  slack  variables. 


i 

) 

i 
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j = l 

8t  Xj  X2  X^  x^  Xj  Xg  RHS 

1 0 1 1 0 0 «=  3 

0 0 1 1 0 0 -5 

0 10  0 10-4 

0 10  0 11-7 

0 110  0 1-6 
10  110  1-4 
10  0 110-8 
0 10  0 10-4 

Xj  Nonnegative,  Integer  j = 1,2,..., 6 


An  Example  of  the  General  Two  Duty  Period  Scheduling  Problem 


Figure  2.14 
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Replace  each  variable  X.  with  two  variables  ar^d 

D J 

X^.  Partition  the  non-zero  entries  of  the  A matrix  into 
D 


two  matrices  A^ 

and  A 

^ where  A^ 

represents  the  first 

segments  of  the 

variables 

and 

represents  the  second 

segments  (where 

they 

exist) . A 

= A^  + A^. 

Divide  the 

costs  C into 

and  C 

^ where  C = 

♦ C^. 

The  new 

problem  then  is 

given 

by 

(P*)  (see  Figure 

2.  15) 

(P')  Min 

C^X^ 

+ 

C^X^ 

st 

A^X^ 

♦ 

A^X^ 

= b 

X 

^ ,x2 

Integer, 

Nonnegative 

where 

= C 

To  the  constraints  [A^|A^|b],  append  the  trivial 
constraint 


0 = 0 

to  get  a new  constraint  matrix  [ A^ ' j A^ ' 1 h ' Then  apply 
transformation  T to  get  [TA^'  |ta^'  | Ib'  ].  see  Figure 
2.16.  Then  [TA^'  |TA^'  |Tb'  ] can  be  interpreted  as  a 
network  flow  problem.  And  our  reformulation  of  (P) 
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Min  2 Clxi  E Cjxj 

j = l J ^ j-1  ^ ^ 


8t  xj  X?  xi  X?  xl  X?  x}  X?  xi  x|  xi  RHS 

11223344550 

10001010000-3 

00001010000-5 

00100000100-4 

00100000100-7 

00100100001-6 

01000101001-4 

01000001010-8 

00010000010-4 


J - 1,2,.. .,5 


J — 1,2, ...,6 


j — 1,2,. ..,5 


The  Two  Duty  Period  Scheduling  Problem  of  Figure  2.14  with 

Decoupled  Columns 


Figure  2.15 


yl  = x2 
j j 


Xj  Nonnegative,  Integer 
2 

X.  Nonnegative,  Integer 
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St  x}  X?  xj  xl  x\  xl  x}  x}  xj  x|  xi  RHS 

11223344556 

10001010000-3 
-1  0000000000-2 
0 0 1 0 -1  0 -1  0 1 0 0 = -1 

00000000000-3 
00000  100  -1  0 1-  -1 
01  -1  00001000  - -2 

00000  -1  0001  -1  -4 

0 -1  0 1 000  -1  000  - -4 

000  -1  00000  -1  0 - -4 


I 


Nonnegative , 


Integer 


Nonnegative, 


Integer 


J 

j 

J 


1.2..  ...5 

1.2..  ...6 

1,2, ...,5 


The  Two  Duty  Period  Scheduling  Problem  of  Figure  2.14 
Transformed  to  a Network  Flow  Problem 

Figure  2.16 


Min 

St  TA^'X^  + TA^'X^  = Tb* 

IX^  - IX^  = 0 

X^,X^  Integer,  Nonnegative 

can  be  interpreted  as  a network  flow  problem  with  side 
constraints.  The  nature  of  these  side  constraints  is  to 
couple  pairs  of  variables,  forcing  each  pair  to  have  the 
sane  value.  Hence,  the  two  duty  period  scheduling 
problem  is  equivalent  to  a network  flow  problem  with 
coupled  arcs. 

2.6  Decoupling  Columns  in  Mathematical  Programming 
to  Obtain  Equivalent  Problems 

Consider  the  mathematical  programming  problem 

(LP)  Min  CX 

St  AX  = b 

or  equivalently 

t 

t 

i 

f- 
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(LP)  Hin 


N 

Sc 


j 


where  Aj  is  the  jth  column  of  the  HxN  matrix  A«  Suppose 
each  vector  [CjIAj]  were  to  be  replaced  by  vectors 


[Cj  I ] 


A * A 


having  the  property  that 


j 

[C  I A ] = S [cj  I a'J  3,  j 
J J J J 


= 1,...  ,N 


k 

and  simultaneously  X,  is  replaced  by  fX  tfe 

J J j 

can  then  create  a new  problem  (LP*)  with  these  new 

variables  plus  the  new  constraints  , A = 

1 ,Aj-1,  j = 1,...  N. 


(LP*) 


N 


S s cjx'; 

j=l  k=l  ^ ^ 


N 


2 2 A^x^ 


A ? X?  = b 


j-=l  k=l 


_ k „ k+l  « . - • « 

Xj-Xj  -0,  k=l,.-.,Aj-1 

j = 


Proposition  2. 1 


(LP*)  is  equivalent  to  (IP) 


Proof;  Part  1 


let  X = (X2,X2#---fXjj  ) be  a solution  to  (LP) 

let  X^*  = X j = k = 1,...,k 

j j j 


X 

k*  _ 
j 

x!;-.  K 

j = 

N 

■'J 

N '‘j 

N 

S 

S 

‘J 

x*^*  = 
j 

S S aJ  X 

j=l  k=l  -J 

=s 

k=l 

j=l 

N 

*^.1 

N ’'j 

N 

S 

S 

^5 

k* 

S S cj  X. 

j=l  k=l  J J 

j-i 

k=l 

J 

j=l 

X*  = fX^*  X X^*  X^^*)  is 

X (Xj  ,Xj  ,X2  ,...,Xj^  ) is 

a solution  to  (LP*)  wita  value  CX 


Part  2 


1*  ki*  k * 

Let  X*  = (Xj  , . . . ,X  j , . . . , X jjN  ) be  a solution  to  (LP*) 


1*  2* 
then  X j =3^ 


let  Xj  =X^ 


• = Xjj  , j = 1,2,. ..,N 


then 


N 

N 

N ^ 

S c 

= 23 

< 23 

j 

= 23  23 

J=1  J J 

j-i 

k=l 

j=l  k=l 

N 

N 

S Aj  Xj 

- 23 

(23 

*5 

)Xj 

= s s 

j=l 

j-i 

k=l 

j=l  k=l 

rk* 


A = b 


J j 


so  X = (X  J ,X2  ,...,X^  ) is  a solution  to  (LP) 


N 


with  value 


S S xr. 


j=l  k=l 


J J 


QED 


2.7  A Solution  Procedure  for  the  Two  Duty  Period 
Scheduling  Problem 


Proposition  2.1  has  shown  that  (P*)  in  Section  2.5 


equivalent 

to  (P). 

The  ref  ore. 

a relaxation  of  (P) 

(PR) 

Min 

C^X  ^ ♦ 

c^x' 

st 

TA^  X ^ + 

TA^  X ^ = b 

which  is  simply  a network  flow  problem.  Since  our 
original  problem  (P)  , the  two  duty  period  scheduling 
problem,  is  an  integer  programming  problem;  we  can 
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incorporate  the  relaxation  (PB)  into  a branch  and  bound 
implicit  enumeration  strategy.  The  following  algorithm, 
in  the  notation  of  Geoffrion  and  Harsten  (1972) , 
outlines  a solution  procedure  for  the  two  duty  period 
scheduling  problem. 

Algorithm  2. 1 

STEP  0 Initialize  value  of  the  best  solution  so  far: 
INCUMBENT  = INFINITY 
Initialize  the  level  of  the  tree  search: 

LEVEL  = 0 

Let  the  initial  candidate  problem  be  the 
original  problem: 

(CP)  = (P) 

STEP  1 Solve  (CPR)  , the  relaxation  of  (CP)  . 

If  VALUE  (CPi;)  L INCUMBENT,  GO  TO  STEP  2 
If  (CPE)  solution  is  feasible  in  (P)  then 
set  INCUMBENT  = VALUE  (CPF)  and 
GO  TO  STEP  2 
LEVEL  = LEVEL  ♦ 1 
GO  TO  STEP  2 
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STEP  2 


Create  the  next  Candidate  Problem  (CP)  at 


this  level  of  the  tree  search. 

If  none  exists,  GO  TO  STEP  3 
. GO  TO  STEP  1 

STEP  3 IF  LEVEL  = 0,  STOP 
Back  up  in  tree: 

Set  LEVEL  = LEVEL  - 1 
GO  TO  STEP  2 

The  relaxation  (PR)  can  be  used  with  Algorithm  2.1 
for  a solution  procedure  for  the  two  duty  period 
scheduling  problem.  For  a significantly  different  ap- 
proach to  utilizing  network  relaxations  for  solving  set 
partitioning  problems  see  Mulvey  (1975)  and  Glover  and 
Mulvey  (1976)  . 

Mulvey  (1975)  reformulates  set  partitioning  problems 
as  transportation  problems  with  side  constraints  (or 
equivalently  as  a generalized  network  problem).  The 
number  of  arcs  in  the  network  flow  problem  is  equal  to 
the  number  of  nonzero  entries  in  the  constraint  matrix 
of  the  original  set  partitioning  problem.  Glover  and 
\ Mulvey  (1976)  also  reformulate  the  two  duty  period 

I ‘ 


69 


scheduling  problem  as  a network  problem  with  side 


constraints.  For  each  variable  with  two  segments  they 

create  two  new  nodes  U and  V and  an  "all-or-none"  arc 

j j 

joining  then.  Rather  than  solve  this  network  related 
version  of  the  problem  directly,  the  authors  propose  a 
heuristic  solution  technique. 


The  relaxation  (PR)  is  very  weak,  to  the  point  of 
being  useless.  However,  we  have  great  freedom  in 
choosing  [C^,C^];  the  only  stipulation  being 

= C.  As  we  shall  see  in  Chapter  4,  by  choosing 
[C^,C^]  properly,  we  can  greatly  strengthen  the  relaxation 
(PR)  . In  fact  with  a moderate  effort  we  can  get  (PR)  to 
be  almost  as  strong  as  the  LP  relaxation  of  (P) . 
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PART  II  laplenentation 


I CHAPTER  3 

i' 

i: 

THE  HELSINKI  PROBLEJl  - THE  RELAXATION 

In  this  chapter  ve  will  discuss  a special  case  of 
the  two  duty  p<?riod  scheduling  problem  - the  two  duty 
period  set  partitioning  problem  (or  the  Helsinki 
problem) . The  two  segment  characteristic  of  the  problem 
allows  us  to  use  a minimum  cost  flow  problem  as  a 
relaxation.  The  set  partitioning  characteristic  means 
this  relaxation  is  simply  a shortest  path  problem.  In 
addition,  because  the  Helsinki  problem  is  a set 
partitioning  problem,  we  can  reduce  its  dimensions  by 
logical  considerations.  In  Section  3.3  we  present  an 
algorithm  for  solving  the  two  duty  period  set 
partitioning  problem.  In  the  next  section  we  develop  a 
branching  procedure  to  be  incorporated  into  this 
algorithm.  In  Sections  3.5  and  3.6  we  introduce  an 
algorithm  for  solving  the  shortest  path  relaxation. 
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3.1 


The  Helsinki  Problem  as  a Shortest  Path  Problem 


with  Coupled  Arcs 

A special  case  of  the  two  duty  period  scheduling 
problem  is  the  Helsinki  Problem  (HP) 

(HP)  Min  CX 

St  AX  = 1 

X = 0, 1 

where  1 is  the  M dimensional  vector  of  ones  and  where 
each  column  of  A consists  of  one  or  two  segments  of 
ones.  For  example,  see  Figure  3.1.  It  was  in  this 
form,  a set  partitioning  problem,  that  Harkku  Tamminen 
of  the  Data  Center  of  the  Helsinki  City  Metropolitan 
Area,  first  brought  the  two  duty  period  scheduling 
problem  to  the  author's  attention.  The  Helsinki  problem 
arose  in  trying  to  schedule  the  city's  bus  crews  using 
mathematical  programming. 

Applying  to  (HP)  the  transformations  developed  in 
Chapter  2 yields: 
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(HP*) 

Min 

CY  Y ♦ Z 

St 

(Y,Z)  in  S 

lY  - IZ  = 0 

where  S is  the  set  of  solutions  to  a shortest  path 
probleci.  For  notational  convenience  we  have  replaced 
each  variable  with  (rather  than  X^)  and  (rather 
than  Xj) . See  Figure  3.2.  Using  the  terminology  of 
Geoffrion  and  Marsten  (1972)  we  may  write  the  relaxation 
of  (HP*)  as 

(HPR)  Min  Y + Z 

St  (Y,Z)  in  S 

Then  a feasible  strategy  for  solving  (HP)  is  to 
incorporate  the  relaxation  (HPR)  into  a branch  and  bound 
procedure,  such  as  Algorithm  2.1.  The  relaxation  (HPR) 
is  a shortest  path  problem  with  N 4^  1 nodes  and  (no  more 
than)  2N  arcs. 

Me  employ  the  following  terminology.  He  say  that  a 
column  X j (HF)  corresponds  to  two  VARIABLES  or  ARCS 

Yj  and  ^ ^ (HP*)  and  (HPR).  Re  refer  to  the  coupled 

arcs  Y^  and  Z as  PARTNERS.  Two  variables  (of  (HP*)) 
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The  Helsinki  Problem  of  Figure  3.1  Reformulated  with  Decoupled 

Arcs 

Figure  3.2 
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CONFLICT  if  there  is  a row  covered  ty  both  variables 
(i.e.  each  column  has  a one  in  the  same  row).  We  use 
the  terms  RELAXATION  and  SHORTEST  PATH  PROBLEM 
interchangeably.  Similarly  the  terms  VARIABLE  (of 
(HP*))  and  ARC  are  used  interchangeably,  as  are  ROH  and 
NODE  (of  the  shortest  path  problem) . Consequently,  a 
variable  (of  (HP*) ) covers  a row  if  and  only  if  the 
corresponding  arc  starts  in  or  passes  over  that  row.  An 
arc  starts  at  a node  if  and  only  if  the  corresponding 
segment  of  ones  in  the  original  Helsinki  problem  starts 
at  the  corresponding  row.  An  arc  ends  at  a node  if  and 
only  if  the  corresponding  segment  of  ones  in  the 
original  Helsinki  problem  ends  at  the  row  preceding  the 
corresponding  row. 

3.2  Logical  Elimination  for  the  Helsinki  Problem 

Algorithm  2. 1 can  be  reformulated  more  specifically 
taking  into  account  the  special  features  of  both  the  set 
partitioning  problem  (HP*)  and  the  shortest  path 

i 

relaxation  (HPR).  Because  the  original  problem  (HP*)  is  | | 

a set  partitioning  problem,  we  know  that  once  a variable  | j 


is  chosen  (i.e.  set  equal  to  one) , all  variables  which 


r 


^ ^ conflict  with  it  may  be  eliminated  (i.e.  set  equal  to 

I zero) . 

ti 

it 

Similarly,  we  can  take  advantage  of  the  fact  that 
the  relaxation  is  a shortest  path  problem  and  that  it  is 
obtained  by  decoupling  arcs.  Consequently,  what  is  true 
of  one  arc  is  true  of  its  partner.  Therefore,  when  an 
arc  is  chosen,  we  can  eliminate  all  the  arcs  which 
conflict  with  its  partner  as  well  as  itself.  Of  course, 
when  an  arc  is  eliminated,  its  partner  is  also 
eliminated. 

From  the  shortest  path  formulation  it  can  be  seen 
that  if  no  arcs  enter  a node,  then  all  arcs  leaving  that 
node  may  be  eliminated.  This  is  simply  because  uo  arc 
leaving  that  node  can  participate  in  a solution  to  the 
shortest  path  problem.  Similarly,  if  no  arcs  leave  a 
node,  all  arcs  entering  that  node  may  be  eliminated. 


3.3  An  Algocitha  £oc  Solving  the  Helsinki  Problem 

Algorithm  2. 1 outlines  the  general  procedure  for  a 
tree  search.  It  is  an  interesting  feature  of  the 
algorithm  which  we  will  now  develop  that  a node  in  the 
search  tree  will  often  have  more  than  two  successor 
nodes.  At  any  given  node,  rather  than  choosing  a single 
separation  variable  (SEPVAE)  to  branch  on,  our  algorithm 
will  choose  a number  or  variables  in  a manner  we  will 
describe  in  Section  3.4.  They  will  correspond  to  the 
arcs  starting  at  a given  node  in  the  shortest  path 
relaxation.  Each  branch  at  a node  will  correspond  to 
including  exactly  one  of  these  arcs.  See  Figure  3.3 
corresponding  to  the  example  in  Figure  3.2. 

He  have  employed  some  special  notation  in  Figure 
3.3.  Each  circle  corresponds  to  a node  of  the  search 
tree.  The  number  within  each  circle  represents  the 
level  on  the  search  tree  (i.e.  the  number  of  separation 
variables  which  have  been  chosen  in  reaching  that  node) . 
Each  arc  in  Figure  3.3  corresponds  to  a branch.  The 
label  on  each  arc  indicates  the  separation  variable  at 
that  branch.  For  example,  the  label  T^,  on  the  branch 


i 
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leading  to  the  second  node  on  level  1,  indicates  that 
vas  set  equal  to  one  at  that  branch. 

The  unusual  aspect  of  the  tree  in  Figure  3.3  is 
that  each  node  has  associated  with  it  a height  as  veil 
as  a level.  The  row  numbers  on  the  left  hand  side 
indicate  a node's  height.  Tne  height  of  a node 
indicates  the  first  row  of  (HP*)  which  has  not  been 
covered  by  the  separation  variables  chosen  in  reaching 
that  node.  In  terms  of  the  shortest  path  interpretation 
of  (HP),  choosing  a separation  variable  to  be  in  the 
solution  corresponds  to  choosing  an  arc  to  be  in  the 
path.  Consequently,  the  row  number  or  the  height  of  a 
node  in  the  search  tree  corresponds  to  the  node  in  the 
shortest  path  solution  which  has  been  reached  as  a 
result  of  choosing  separation  variables  (or  arcs) . For 
example,  choosing  Y^  to  branch  on  at  level  0 brings  us 
to  a node  at  level  1 and  height  3..  Row  3 is  the  first 
row  not  covered  by  Y^.  And  the  arc  Y3  ends  at  node  3 in 
the  shortest  path  interpretation.  At  any  node  of  the 
search  tree  each  branch  corresponds  to  an  arc  which 
begins  in  the  first  row  not  covered  by  the  separation 
variables  chosen  so  far  (that  row  is  just  the  height  of 
the  node). 


■/ 


80 


Algorithm  3. 1 reiterates,  with  greater  specificity. 
Algorithm  2.1. 


Algorithm  3.  1 

STEP  0 INCUMBEi^T  = IHFIHITY 

LEVEL  = 0 
(CP)  - (HP) 

(CPR)  = (HPfi) 

STEP  1 Solve  (CPR) 

IF  VALUE  (CPR)  ^ INCUUBEHI,  GO  TO  STEP  2a 
IP  (CPR)  solution  satisfies  Y = Z then 
set  INCUMBENT  = VALUE  (CPR)  and 
GO  TO  STEP  2a 
LEVEL  = LEVEL  ♦ 1 
GO  TO  STEP  2b 

STEP  2a  Set  SEPVAR (LEVEL)  = PARTNER (SEPVAR (LEVEL) ) = 0 
Reinstate  those  variables  eliminated  at  this 
LEVEL  because  they  or  their  partners 
conflicted  with  SEPVAR (LEVEL)  or  its 
part ner. 
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STEP  2b  Create  the  next  Candidate  Probien  (CP) : 


Choose  a new  separation  variable  for  this  level, 
SEPVAR (LEVEL) 

If  none  exists,  GO  TO  STEP  3 
Eliminate  (i.e.  set  equal  to  zero)  all 
variables,  and  their  partners,  which 
conflict  with  SEPVAK  (LEVEL)  and 
PARTNER (SE?V  AR (LEVEL) ) 

Set  SEPVAR  (LEVEL)  = PAETKEF (SEPVAR (LEVEL) ) = 1 
GO  TO  STLP  1 

STEP  3 IF  LEVEL  = 0,  STOP 
Back  up  in  tree: 

Reinstate  all  the  separation  variables 
eliminated  at  this  LEVEL 
Set  LEVEL  = LEVEL  - 1 
GO  TO  STEP  2a 


3.4  The  Branching  Procedure 

The  shortest  path  interpretation  of  the  Helsinki 
problem  suggests  a natural  branching  procedure.  The 
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original  problen  (HP)  can  be  considered  one  of  trying  to 
construct  a shortest  path  such  that  for  each  arc 
included  in  the  path  its  partner  is  also  included. 
Consider  then,  the  branching  procedure  as  one  of  trying 
to  construct  such  a path.  At  the  top  level  of  the  tree 
solve  the  relaxation.  Then  choose  as  the  separation 
variable  the  arc  in  the  relaxation  solution  which  begins 
at  node  1.  See  Figure  3.3. 

A similar  process  is  used  to  choose  a separation 
variable  at  all  nodes  of  the  decision  tree.  Each  such 
node  corresponds  to  a partial  solution  (the  set  of 
variables  fixed  to  be  in  the  solution  by  choosing 
separation  variables  at  each  level  in  reaching  this 
node) . To  choose  the  next  separation  variable,  solve 
the  relaxation.  Then  choose  the  first  arc,  say  in 
the  shortest  parh  solution  that  has  not  already  been 
chosen  as  a separation  variable  in  reaching  the  present 
node  of  the  decision  tree.  Then  is  the  natural 
candidate  for  the  separation  variable.  At  the  next 
level  set  Y^  (and  Z^)  equal  to  one  and  repeat  the 
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Rhen  a node  is  fathomed  the  separation  variable  at 
the  present  level  is  eliminated  and  the  variables  which 
were  eliminated  at  this  level,  because  they  conflicted 
with  this  separation  variable,  are  reinstated  (i.e. 
they  become  free  variables  with  no  fixed  value).  The 
relaxation  is  resolved  and  a new  separation  variable  is 
chosen,  if  the  relaxation  is  feasible,  in  the  manner 
described  above. 

This  procedure  is  equivalent  to  binary  branching 
(two  branches  from  each  node,  one  with  the  separation 
variable  set  equal  to  one,  and  the  ether  equal  to  zero) 
in  the  sense  that  choosing  the  next  separation  variable 
at  a level  is  equivalent  to  making  a branch  with  the 
(previous)  separation  variable  set  equal  to  zero. 


3.5  Solving  the  Relaxation 

Step  2 of  Algorithm  3.1  greatly  reduces  the  size  of 
the  candidate  problem.  Consider  the  example  of  Figure 
3.2.  The  corresponding  shortest  path  problem  is  given 
in  Figure  3.4.  Since  Node  2 has  no  exit,  we  may 
eliminate  and  its  partner  (see  Figure  3.5). 


On 


the  first  branch,  if  ve  choose  as  the  separation 
variable,  then  we  may  eliminate  those  arcs  conflicting 
with  either  Y^  or  (see  Figure  3.6).  Similarly,  at 
the  second  possible  branch  from  level  1,  we  may  choose 
Y^  as  the  separation  variable.  Conseguently,  we  may 


eliminate  those  variables  (and  their  partners) 


conflicting  with  either  Y^  or  Z^  (see  Figure  3.7). 


In  the  same  manner,  we  can  reduce  the  problem  size 
by  logical  elimination  at  Step  1 of  Algorithm  3.1,  while 
solving  the  relaxation  problem  (CPR).  Our  relaxation  is 
a shortest  path  problem  which  we  will  solve  with  a 
labeling  algorithm.  Consider  the  following  forward 
reaching  algorithm  for  solving  a shortest  path  problem 
with  M ♦ 1 nodes  [Denardo  & Fox  (1977)  ]: 


Algorithm  3.2 

STEP  0 Let  FORWARD  LABEL (NODE)  = INFINITY  for  each 

node  except  NODE  = 1,  FORWARD  LABEL  (1)  = 0. 
Let  NODE  = 1. 


I 


Choose  as  separation  variable  and  eliminate  those  arcs 
conflicting  with  (Yj),  Y^  and  their  partners:  (Z^) , 


Yj  chosen,  so  eliminate  arcs  conflicting  with  its  partner  Z 


Branch  1 Level  0 


Figure  3.6 


STEP  1 For  each  arc  which  begins  at  NODE,  if 
FOfiWARO  LAB£L(NOOE)  * COST (ABC)  is  less 
than  the  FORVABD  LABEL  for  ENDABC,  the 
node  at  which  ARC  ends,  replace  FORWARD 
LABEL  of  EMDAEC  with  FORWARD  LABEL  (NODE)  ♦ 
COST  (ARC)  . 

STEP  2 Let  NODE  = NODE  ♦ 1 . If  NODE  = M ♦ 1 , STOP. 

STEP  3 If  FORWARD  LABEL  (NODE)  < INFINITY,  GO  TO  STEP  1. 

GO  TO  STEP  2 


Similarly  there  is  a backward  reaching  algorithm. 
See  Figure  3.8. 


3.6  An  Iterative  Backward-Forward  Algorithm  for 
Solving  the  Relaxation 

Note  that  if  the  branch  and  bound  scheme  has 
already  found  a feasible  solution  there  is  an  INCUMBENT 
< INFINITY.  In  step  3,  INFINITY  may  be  replaced  by 
INCUMBENT,  making  the  algorithm  stronger.  However,  an 
even  stronger  test  may  be  applied  at  step  3. 
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Forward  Reaching: 


I O 0 o o © o 

Generates  a set  of  forward  labels  for  each  node  j,  j = 1,2 9, 

1 

I Indicating  the  cost  of  the  cheapest  path  from  node  1 to  node  j. 

Backward  Reaching: 

0 0 0 0 0 0 

Generates  a set  of  backward  labels  for  each  node  j,  j = 1,2,... 9, 
indicating  the  cost  of  the  cheapest  path  from  node  j to  node  9. 

Shortest  Path  Algorithms 

Figure  3.8 
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Suppose  the  relaxatioo  is  solved  first  usio^  the 
backward  reaching  algorithoi.  This  would  result  in  each 
node  having  a BACKWARD  LABEL  indicating  the  cost  of  the 
cheapest  path  from  that  node  to  the  H * 1st  (last)  node. 
Now  consider  the  implications  for  a subsecuently  applied 
forward  reaching  algorithm  to  the  relaxation  . The  test 
at  step  3 may  become 

If  FORWARD  LABEL (NODE)  ♦ BACKWARD  LABEL  (NODE)  < 
INCUMBENT,  GO  TO  STEP  1 

This  test  checks  each  node,  to  see  if  it  is 
possible  for  an  arc  ending  or  starting  at  that  node  to 
participate  in  a solution  better  than  the  incumbent. 

The  new  test  at  step  3 may  allow  us  to  eliminate  all 
arcs  (and  their  partners)  which  begin  at  the  node  being 
considered. 

However,  because  of  the  coupled  arc  nature  of  our 
shortest  path  relaxation,  it  is  better  to  instead  add  a 
new  test  at  step  1: 

If  FORWARD  LABEL  (NODE)  * COST  (ARC)  BACKWARD 
LABEL  (ENDARC)  > INCUMBENT,  eliminate  ABC 


This  new  test  at  step  1 may  allow  the  ABC 
considered  to  be  eliminated  since  it  cannot  participate 
in  a solution  of  the  (CPR)  better  than  the  INCUMBENT. 
When  the  arc  is  eliminated,  its  partner  is  also 
eliminated.  This  new  test  at  step  1 makes  the  new  test 
at  step  3 redundant.  See  Figure  3.9. 

The  advantage  of  these  new  bounding  tests  is  that 
they  may  eliminate  arcs  (and  conseguently  their 
partners).  The  tests  reduce  the  problem  size  and 
tighten  the  relaxation  by  eliminating  partner  arcs. 

Of  course  the  forward  labels  can  be  used  in 
strengthening  the  backward  reaching  algorithm  in  a 
similar  manner.  An  iterative  process  of  solving  the 
shortest  path  problem  by  backward  and  forward  reaching 
can  be  used  until  equilibrium  is  reached,  (i. e.  until  no 
arcs  are  eliminated  in  a full  cycle  of  the  algorithm  - 
one  backward  pass  and  one  forward  pass). 

Algorithm  3.3  is  the  solution  procedure  for  solving 
the  shortest  path  relaxation  (CPB)  in  Algorithm  3.  1, 
which  incorporates  these  ideas.  FOBUARD  (BACKWARD)  STOP 
is  a flag  to  let  us  know  whether  or  not  the  last  forward 
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Arc 


Forward  Label 


Backward  Label 


for  node  4 


for  node  6 


Y may  be  eliminated  if  FORWARD  LABEL(4)  + COST(Y  ) 
^ j 

BACKWARD  LAB EL (6)  is  greater  than  INCUMBENT 


Iterative  Backward-Forward  Reaching 


Figure  3.9 
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(backward)  pass  of  the  algorithm  eliminated  any 
variables.  Consequently/  when  FOHNABD  STOP  = BACKTAB D 
STOP  = 1/  the  algorithm  is  finished. 

Algorithm  3.3 

STEP  0 Let  BACKWABD  LABEL (NODE)  = 0 ior  all  NODES 
BACKKABD  STOP  = 0 

STEP  1 Let  FOBWAED  LABEL (NODE)  = INFINITY  for  each  node 
except  NODE  = 1,  FOBKARD  LABEL (1)  = 0. 

Let  NODE  = 1. 

FOBWAED  STOP  = 1.. 

STEP  2 Choose  an  ABC  which  begins  at  NODE 
If  none  exists  go  to  STEP  4 
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STEP  3 


STEP  5 


If  FORWAED  LABEL  (NODE)  ♦ COST  (ARC)  ♦ 

BACKHARD  LABEL ( EHDARC)  > INCUMBENT, 
eliolnate  ABC,  PARTNER (ARC)  , set 
FORWARD  STOP  = 0,  GO  TO  STEP  2. 

If  FORWARD  LABEL  (NODE)  + COST  (ARC)  < 
FORWARD  LABEL(ENDAEC) , FORWARD 
LABEL  (ENDARC)  = FORWARD  LABEL  (NODE) 

♦ COST  (ARC) 


GO  TO  STEP  2. 


STEP  4 Let  NODE  = NODE  + 1 


If  NODE  = K + 1 (the  last  node) , GO  TO  STEP  6. 


If  FORWARD  LABEL(NODE)  + BACKWARD  LABEL (NODE) 

< INCUMBENT,  GO  TO  STEP  2 
Eliminate  all  arcs  and  their  partners  which  begin 

at  NODE.  If  any  arcs  are  eliminated  at  this  step, 
set  FORWARD  STOP  = 0 
GO  TO  STEP  4 


STEP  6 If  FORWARD  LABEL (M  ♦ 1)  > INCUMBENT,  STOP 

If  shortest  path  solution  satisfies  Y = Z,  STOP 
If  FORWARD  STOP  = BACKWARD  STOP  = 1,  STOP. 
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STEP  7 Let  BACKWARD  LABEL (NODE)  = INFINITY  for  each 
node  except  NODE  = M ♦ 1, 

BACKWARD  LABEL  (d  1)  = 0. 

Let  NODE  = M ♦ 1 
BACKWARD  STOP  = 1 

STEP  8 Choose  an  arc  which  ends  at  NODE. 

If  none  exists  go  to  Step  10. 

STEP  9 If  BACKWARD  LABEL (NODE)  + COST  (AFC)  + 

FORWARD  LABEL (BEGIN  ARC)  i 

INCUMBENT,  Eliminate  ARC,  PARTNER (AFC)  , 

set  BACKWARD  STOP  = 0,  GO  TO  STEP  8. 

If  BACKWARD  LABEL  (NODE)  ♦ COST  (ARC)  < 
BACKWARD  LABEL(BEGIN  ARC), 

BACKWARD  LABEL (BEGIN  ARC) 

= BACKWARD  LABEL  (NODE)  ♦ COST  (ARC) 

GO  TO  STEP  8. 


STEP  10  Let  NODE  = NODE  - 1 

IF  NODE  =1,  GO  TO  STEP  12 
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STEP  11  If  BACKRARD  LABEL  (NODE)  FORRABD  LABEL  (MODE) 
< INCOMBENT,  GO  TO  STEP  8 


EliBinate  all  arcs  and  their  partners  which 

end  at  NODE.  If  any  arcs  are  eliminated  at  this 
step  set  BACKWARD  STOP  = 0. 

GO  TO  STEP  10 

STEP  12  If  BACKWARD  LABEL (1)  i INCUMBENT,  STOP 

If  shortest  path  solution  satisiies  Y = Z,  STOP. 

If  FORWARD  STOP  = BACKWARD  STOP  = 1,  STOP. 

GO  TO  STEP  1. 


A 


In  applying  the  methods  of  this  chapter  it  was 
found  that  the  shortest  path  relaxation  is  too  weak. 

This  HELSINKI  algortihm  did  not  compare  favorably,  in 
solving  large  problems,  with  existing  more  general 
algorithms  such  as  Marsten's  Set  Partitioning  Algorithm 
(SETPAR).  The  HELSINKI  algorithm  was  effective  in 
reducing  the  size  of  the  problem  (by  roughly  60%  at  the 
second  level  of  the  tree,  80%  at  the  third  level  and  90% 
at  the  fourth  level  . However,  the  relaxation  was  so 
weak  that  most  nodes  in  the  tree  were  fathomed  by 
infeasibility  (due  to  eliminating  variables)  rather  than 
by  bounding.  Consequently,  the  algorithm  generated  a 
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huge  tree.  In  spite  of  the  speed  with  which  the 
shortest  path  subprobleos  could  be  solved;  the  entire 
I Helsinki  problem  could  not  be  solved  in  a reasonable 

I v amount  of  time. 

! 

j In  order  to  overcome  this  difficulty,  we  took  steps 

' to  tighten  the  relaxation.  These  will  be  discussed  in 

the  next  chapter. 
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TIGHTENING  THE  RELAXATION 

In  this  chapter  we  construct  a Lagrangean 
relaxation  of  the  Helsinki  problem  by  dualizing  with 
respect  to  the  coupling  constraints.  This  results  in  a 
relaxation  that  is  a shortest  path  problem  with  arc 
lengths  that  depend  on  the  current  values  of  the 
Lagrange  multipliers.  Using  subgradient  optimization  to 
maximize  the  Lagrangean  leads  to  an  interpretation  of 
seeking  an  optimal  allocation  of  the  costs  of  the 
original  variables  between  their  decoupled  arcs.  In  the 
final  section  we  show  that,  by  using  this  technique  to 
tighten  our  relaxation  we  can  reduce  the  size  of  the 
search  tree  generated  by  the  branch  and  bound  algorithm 
of  the  previous  chapter. 


4.1  Subgradient  Optimization  and  the  Helsinki  Problem 


Consider  the  Helsinki  problem: 


100 


(HP)  Min  y ♦ Z 

St  (Y,Z)  in  S 
lY  - IZ  = 0 


Y,Z  = 0,1 


where  S is  the  set  of  shortest  path  solutions. 

By  multiplying  lY  - IZ  by  the  dual  variables  0 and 
adding  the  result  to  the  objective  function,  we  can  form 
the  Lagrangean  relaxation  of  (HP)  [Everett  (1963), 
Geoffrion  (1974)  ]; 


(HPR  ) 

U 

V (U)  = Min 

Y + Z ♦ U (Y  - Z) 

St 

(Y,Z)  in  S 

Y,Z  = 0,1 

Note  that  (HPPy)  has  the  property  that  it  is  not  altered 
by  dropping  the  integrality  conditions  on  Y and  Z (indeed 
(Y,Z)  in  S guarantees  Y,Z  = 0,1).  This  feature  is  known 
as  the  INTEGRALITY  PROPERTY  [Geoffrion  (1974)].  Then 
(HPRy)  can  be  rewritten  as 
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(HPRy)  H(U)  = Min  (C^  ♦ U)y  ♦ (C^  - 0)Z 

St  (y,Z)  in  S 

Note  that  W (U)  is  a piecewise  linear  concave  function 

since  it  is  the  pointwise  ninimuo  of  a family  of  linear 

functions  of  0.  Since  U (U)  has  the  integrality  property 

Max  H(U)  is  equivalent  to  the  linear  programming 
U 

relaxation  of  the  Helsinki  problem  £Geo£frion  (1974)],. 
Geoff rion  has  shown  that  Lagrangean  relaxation  can  be 
very  effective  when  used  in  branch  and  bound  algorithms. 

Held,  Wolfe  and  Crowder  (1974)  have  shown  that 
using  a technique  now  known  as  subgradient  optimization 
is  effective  in  maximizing  the  Lagrangean.  Consider  our 

A A A 

function  W((J).  Suppose  (Y,Z}  is  optimal  for  0: 

A mm  A ^ A 

H (U)  = (C  * ♦ U)  Y ♦ (C  - U)  Z 


He  define  s to  be  a SUBGKADIENI  of  H at  U if  and  only  if 


H (U)  < H (U)  ♦ (D  U)  s 


for  all  0.  Then  the  vector  s at  U points  into  the  half 
space  (orthogonal  to  s)  which  contains  all  better 
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(larger)  solutions  to  H.  Consider  a 0*  such  that  ll(U') 

> H(U) , then 

H(U')  > H(D)  > H(O')  - (U*  - U)s 

= > (a*  - ff)s  > 0 

which  means  (U ' - 6)  forms  an  acute  angle  with  s..  This 
implies  that  U'  is  in  the  half  space  into  which  s points 
at  0. 

Consequently,  a subgradient  can  be  considered  a 
good  direction  to  go  in  search  of  a higher  value  of  W. 
Consequently,  we  will  maximize  W iteratively.  At  each 
iteration  we  choose  a new  U^+l  by  moving  from  our  last 
U ^ in  the  direction  st.  That  is 

sr  0^  ♦ a*^s*^ 


where  a^  is  the  step  length. 


Now,  we  show  that  (Y  - Z)  is  a subgradient  of  H at 
6.  Recall 


^ AA  A 

H(U)  = C^'y  + crz  ♦ U(Y  - Z) 
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Now 


H(U)  = Min  C^Y  ♦ C?Z  ♦ U(Y  - Z) 
St  (Y,Z)  in  S 


=>  U (U)  4 C ^ Y + Z ♦ U (Y  - Z) 


=>  U (U)  4 C ^ Y ♦ Z ♦ U (Y  - Z) 

A ^ A A ^ 

U (Y  - Z)  ♦ U (Y  - Z) 


=>  H(U)  4 W(0)  ♦ (U  - U)  (Y  - Z) 


which  is  just  the  definition  of  (Y  - Z)  as  a subgradient 
of  W at  0.  Notice  that  our  proof  of  this  did  not  depend 
on  the  constraints  (Y  - Z)  put  into  the  objective 
function.  A general  result  holds  that  whenever  a 
Lagrangean  relaxation  is  formed,  if  constraints  AX  4 b 
ace  put  into  the  objective  function  then  the  vector 
(AX  - b)  is  a subgradient  for  the  Lagrangean. 


Take  (HPB^)  as  the  relaxation  of  the  Helsinki 
problem  (HP).  In  particular  (HPBq)  is  the  Shortest  Path 
Relaxation  obtained  in  chapter  2 by  decoupling  the  arcs. 

A A ^ 

However,  since  Y - Z is  a subgradient  of  H at  0,  the 
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relaxation  oay  be  tightened  by  utilizing  subgradient 

optimization  technigues  [Agmon  (1954),  notzkin  & \ 

Schoenberg  (1954),  Poljak  (1967),  Coffin  (1971,  1976), 

Held  6 Karp  (1971),  Held,  Wolfe  S Crowder  (1974), 

Fisher,  Northup  & Shapiro  (1975)  }.  In  this  way  the 
Lagrangean  relaxation  value  should  approach  the  value  of 
the  linear  programming  relaxation. 

He  use  the  following  version  of  subgradient 

optimization.  Let  H be  the  TARGET  VALUE,  a guess  at  the 

optimal  value  of  Max  W(U)-  Define  ♦ a*^s*^ 

u 

where  s is  the  subgradient  at  iteration  t and  a ^ is  the 
step  length.  Since  W(u'^‘'’M  < H (U*^)  ♦ - U •^)  s*^  we 

may  consider  W (U  ^ ) ♦ (U^"*"^  - U ^ ) s to  be  an  upper 
bound  or  approximation  for  H (U*'"''^  ),  Then,  since  we  want 

to  reach  our  target  value  H,  pick  U so  that  our 

. ^ t+1  t t t 

approximation  will  equal  H.  Choose  U = U a s 

(i. e.  choose  a^,  since  0^  and  s^  are  already 

determined)  to  satisfy  H = H(U*^)  ♦ (U  - U )s  = H(U  ) 

♦ at  |st|2  , Then  a*^  = [H  - H(o‘^)]/  Is'^l^  - 

Computationally,  solving  for  a^  and  U^^^  at  each 

iteration  is  very  simple.  Since  s = Y - Z , (where 

(Y^,Z  *)  is  optimal  in  W (U^  ) ) , |s*^l^  is  just  equal  to 
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the  nuBber  of  arcs  in  the  shortest  path  solution  whose 
partners  are  not  included  in  the  solution. 


In  practice,  in  order  to  assure  convergence, 
we  will  include  a multiplier  d^  , 0 < d^  £ 2,  in 
computing  a*^  [Held,  Wolfe  & Crowder  (1974)] 

a = d*'  [ W - W (u*^)  ]/  |st|2 

4.2  Interpreting  the  Subgradient  Optimization  Iteration 
for  the  Helsinki  Problem 

Conceptually  then,  each  iteration  of  the  subgradient 
optimization  merely  reallocates  the  cost  C^  of  Xj 
between  Ij  and  Zj  . 

N 

W(U)  = Min 

(Y,Z)  in  S ^ 

♦ d*^[H  - W(U*')  ]s^/ 

♦ d’^CW  - H(D^  ](Y  J - 2*^  )/  I s*^|^ 

J J 
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so 


= Cj  ♦ oj  ♦ d*'[H  - - zj  )/  Is*"!^ 

= (c]  )^  ♦ 


d’^CiT  - IJ(U‘=)  ](y|  -zS/  13*^1 

J J 

and  = (C^  )•'  - d*'tw  - H(a’')J(yj  - )/  Is*) 

If  we  let  P*  be  the  penalty  at  iteration  t + 1 
P*  = d*[W  - W(0*)  ]/  I 3*1^ 


Then  (C^ 


Y . t+1 


(Cj  ) 


And 


Z t+1 


(Cj  ) 


‘-=1 

+ 

t 

p'd' 

t 

- ^5' 

+ 

p' 

^ t 

if  Yj 

= 

1 

and 

, , 

If  Yj 

= 

- 

p * 

^ t 
i£  Yj 

= 

0 

and 

= 

(Cj 

p* 

''t 

if  Yj 

= 

1 

and 

£*  = 

if 

= 

z* 

,cj 

+ 

p* 

•^t 
if  Yj 

= 

0 

and 

z'^  = 

For  a variable  , if  the  partnern  and  Z^  are 
both  included  in  the  sJ ortent  path  relaxation  nolution  or 
are  both  not  included  there  is  no  adjustment  in  the 


allocation  of  C ^ to  C 


J 


and  . However, if  one  partner 
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(say  Tj  ) is  included  and  the  other  j ) not,  then  the 
included  partner  is  given  a higher  cost  [ (C^)^  ♦ P *^  ] to 
■ake  it  less  attractive  and  the  excluded  partner  is  given 
a lower  cost  [ (C^)  ^ J to  naJce  it  more  attractive. 

The  algorithm  tries  to  make  partners  equally  desirable 


so  that  if  one  is  chosen  the  other  is  also  chosen 


4.3  Choosing  the  Target  Value 


In  implementing  subgradient  optimization  there  are 
a number  of  degrees  of  freedom.  Generally  is 


It  has  been  shown  [Held,  Holfe  & Crowder  (1974)  ] that  the 


algorithm  will  converge  to  Max  H(U)  if  0 < d — 2, 

d*^  ->  0 and  the  sum  of  the  d s diverges.  In  practice 


we  have  begun  with  d = 2 and  then  periodically 
reduced  it  by  half.  This  strategy  follows  that  of  Held, 


Holfe  and  Crowder  (1974) 


A further  choice  io  the  algoritha  is  the  selection 
of  the  target  value.  Solving  the  relaxation  serves  two 
purposes: 

1.  A relaxation  value  greater  than  the  incunbent 
allows  us  to  fathom  that  node  of  the  search 
tree 

2-*  A relaxation  solution  which  is  feasible  in  the 
original  problem  (HP)  also  allows  us  to 
fathom  that  node  of  the  tree.  This  is  because, 
in  our  Lagrangean  relaxation  W(U),  we  dualized 
with  respect  to  the  constraints  Y = Z.  Since 
these  constraints  are  equations,  finding  a 
feasible  answer  guarantees  complementary 
slackness. 

Rather  than  take  the  incumbent  as  the  target  value 

we  have  generally  taken  a value  somewhat  higner.  On  the 

assumption  that  Max  U (0)  is  greater  than  the  incumbent, 

U 

it  is  reasonable  that  H,  which  would  be  Max  H (U) 

u 

ideally,  be  greater  than  the  incumbent.  He  have  tried 
using  the  incumbent  as  well  as  higher  values.  He  have 
found  the  latter  to  be  significantly  more  effective  in 
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reaching  the  incumbent,  in  practice  we  have  used  120 X 
of  the  incumbent  as  the  target  value. 


i' 

!l 

I 4.4  Tree  Structure  - Bounding  a Set  of  Nodes 

* 

► 

[, 

The  Lagrangean  relaxation  approach  developed  here 

has  been  embedded  in  a tree  search.  Specifically  we 

have  used  it  as  the  relaxation  procedure  in  Algorithm 

I 3.3.  Computational  experience  with  this  algorithm  shows 

that  for  each  level  we  descend  in  the  tree  we  eliminate 

a large  proportion  of  the  variables.  Conseguently  in 

the  lower  levels  of  the  tree  we  are  dealing  with  a small 

subset  of  the  original  variables.  The  result  is  that 

the  subgradient  optimization  procedure  tends  to  derive 
Y Z 

costs  (C  , C ) for  the  uncoupled  arcs  that  are  not 
reasonable  in  other  parts  of  the  tree. 


In  order  to  combat  this  problem  we  have 
experimented  with  a number  of  implementation  options. 
Etcheberry  (1976)  has  reported  that  one  successful 
strategy  is  to  save  the  values  of  the  dual  variables 
at  each  level  of  the  tree  and  use  them  the  next  time 
that  level  is  visited.  Ne  have  tried  saving  the  dual 
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variables  and  utilizing  them  in  a number  of  ways.  At 
each  level  ve  tried  using  the  last  set  of  dual  variables 
obtained  at  that  level.  We  have  also  tried  using  the 
last  set  of  dual  variables  used  at  the  next  higher 
level.  In  addition  we  have  tried  a number  of  more 
exotic  (and  less  effective)  ways  of  using  the  stored 
dual  variables. 

However,  we  finally  settled  on  a procedure  which 

does  not  require  saving  dual  variables  at  all.  At  level 

0 we  begin  with  dual  variables  U®  = 0 and  apportion  the 

cost  of  each  variable  in  the  original  problem  between 

Cj  and  Cj  , proportional  to  the  number  of  rows  covered  by 

the  segment  of  ones  in  and  7,^  respectively.  For 

example  if  C . = 100,  Y . covers  3 rows  and  Z . covers  7 rows, 

then  we  set  = 30  and  = 70.  From  then  on  we  simply 
D D 

use  the  most  recently  generated  set  of  dual  variables  to 
determine  (cT,C^>  The  effect  of  this  is  to  use  the 
dual  variable  values  obtained  at  the  next  higher  level, 
as  ve  are  moving  down  in  the  tree.  When  a node  is 
fathomed,  before  choosing  a new  separation  variable  at 
that  level,  we  try  to  fathom  all  of  the  rest  of  the 
candidate  separation  variables  at  that  level  in  one  fell 
swoop  by  solving  the  relaxation  with  the  old  separation 
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variable  eliminated.  This  is  equivalent  to  doing  a 
branch  vith  the  old  separation  variable  set  equal  to 
zero,  hence  we  refer  to  it  as  binary  branching.  This 
binary  branching  serves  a number  of  purposes. 

First,  it  is  very  often  successful  in  eliminating 
all  of  the  rest  of  the  possible  branches  at  that  level. 
This  allows  us  to  immediately  move  back  up  in  the  tree 
one  more  level.  If  there  are,  say,  eight  more  candidate 
branches  at  the  current  level,  for  the  work  of  one 
branch  we  can  eliminate  all  eight. 

Second,  this  procedure  of  binary  branching  allows 
us  to  choose  the  next  separation  variable  in  a rational 
way.  Even  if  the  binary  branch  is  not  successful  in 
elioiinating  all  of  the  remaining  eligible  branching 
variables,  it  may  succeed  in  eliminating  some  of  them 
when  there  is  an  incumbent  solution  to  the  problem. 
Further,  the  best  solution  obtained  during  the 
subgradient  optimization  is  used  to  choose  the  next 
separation  variable.  This  best  solution  is  a path 
composed  from  the  decoupled  arcs.  Consequently,  we 
simply  use  as  a separation  variable  the  first  arc  in 
this  path  which  has  not  already  been  chosen  as  a 
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separation  variable  in  reaching  this  node  of  the  branch 
and  bound  tree. 


Third,  binary  branching  allows  a mechanism  to  bring 
the  dual  variables  in  line  with  the  optimal  ones  at  the 
current  level.  In  practice,  when  we  are  below  level  two 
in  the  search  tree,  we  do  not  do  a subgradient  opti- 
mization as  we  are  going  down  in  the  tree.  We  have 
found  that  the  improvement  in  the  dual  variables  gained 
is  not  enough  to  offset  the  extra  computation  involved. 
However,  we  do  continue  to  use  subgradient  optimization 
below  level  two  as  we  are  moving  up  (or  sideways)  in  the 
tree,  i.e.  whenever  we  perform  a binary  branch.  Con- 
sequently, once  past  level  two  it  is  very  easy  to  go 
very  far  down  in  the  tree  since  we  use  only  the 
relatively  weak  shortest  path  relaxations,  without 
subgradient  optimization.  Binary  branching  gives  us  a 
method  for  getting  back  up  in  the  tree  without 
examining  a prohibitive  number  of  nodes.  Rather  than 
examine  each  remaining  candidate  branch  individually  at 
a level,  we  are  often  able  to  fathom  them  all  at  once  by 
doing  a branch  with  the  old  separation  variable  set 
equal  to  zero.  In  addition,  as  we  work  our  way  back  up 
in  the  tree  it  is  a method  for  getting  the  best  possible 
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dual  variable  values  at  each  level  (since  we  always  use 
subgradient  optimization  at  a binary  branch)  and  does 
not  require  storing  dual  variables  at  every  level. 

One  problem  we  encountered  with  this  technique  is 
that  we  were  dealing  with  a large  number  of  subproblems 
of  very  small  dimension,  over  and  over  again  we  would 
reduce  the  problem  to  one  with  just  a few  variables  but 
we  could  not  fathom  the  node  with  either  pure  shortest 
path  or  subgradient  optimization  of  the  Lagrangean 
relaxation.  Chapter  5 will  deal  with  a new  "relaxation" 
which  we  developed  in  order  to  handle  this 
characteristic  of  the  problem. 


CHAPTER  5 


A ROLE  FOR  PRIBE  NDBBERS  IN  INTEGER  PROGRAHMING 


In  this  chapter  we  show  that  any  integer  program- 
ming problem  may  be  easily  reformulated  as  a knapsack 
problem  using  the  unique  properties  of  prime  numbers. 

Ve  show  that  the  same  techniques  may  be  used  to  linearly 
order  the  nodes  of  a search  tree  in  solving  an  integer 
program.  Such  a linear  ordering  can  be  used  to  reduce 
the  number  of  subproblems  that  need  to  be  evaluated.  In 
Section  5.5  we  develop  a new  algorithm  for  the  set 
partitioning  problem,  based  on  the  knapsack  representa- 
tion. In  the  next  section  this  algorithm  is  incor- 
porated into  the  HELSINKI  algorithm  to  fathom  small 
subproblems.  Nc  present  an  algorithm  for  the  set 
covering  problem  in  the  last  section. 
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5.1  fie  f or  Bu  la  ting  an  Integer  Programing  Problea  as 

a Single  Constraint  Integer  Prograaaing  Problea 


Consider  an  integer  linear  prograaaing  problea  (IP) 

N 


(IP) 


Hin 


S CjX. 

jol  J J 

N 

St  IS 

j-1  J i 


Xj  > 0,  Integer 


where  a and  b^  are  integers. 


X * 1/.../M 


j — 


It  is  known  that  (IP)  may  be  reformulated  as  a 
singly  constrained  problea  [Garfinkel  6 Neahauser 
(1972)  ]. 


The  technique  involves  considering  the  constraints 
two  at  a time.  A pair  of  multipliers  is  found  and  the 
weighted  constraints  are  combined  to  replace  the 
original  pair  with  a single  constraint.  This  trans- 
foraation  yields  an  integer  program  with  only  one 
constraint  (IP')* 


N 

(IP')  Min  53  C.X. 

J-1  ^ ^ 

N 

St  X/  ^ 

j = l ^ ^ 

Xj  > 0, Integer  j = 1,...,N 

5.2  A New  Method  for  Reformulating  an  Integer 
Programming  Problem  as  a Knapsack  Problem 

It  is  easy  to  show  that  there  exists  a set  of 
multipliers,  namely  the  logarithms  of  the  prime  numbers, 
which  may  be  used  to  combine  an  arbitrary  number  of 
constraints.  This  set  of  multipliers  can  be  used  to 
reformulate  any  integer  programming  problem  as  a 
knapsack  problem  (where  some  of  the  objective  function 
coefficients  may  be  negative) . 

Consider,  again,  the  general  integer  linear 
programming  problem  (IP).  Let  P i = 1,...,M  be  the 
first  M prime  numbers.  Then,  if  Xj  is  an  integer, 

j ~ 1,»..,N 
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<=> 


<=> 


N 

2 a X s b 
jTi  ^ 

M 

S ajjXj  - b, 


= 1 


N 


M 2ja. .X, 

Hp/-' 

1=1  ^ 


- b. 


= 1 


i = 


i = 


<=> 


M N 

2 (S 

1=1  j=i 


a .,X. 
iJ  J 


- \ )lnP^ 


= 0 


<=> 


N M 

2 (2  a InP  )X 

j=l  1=1  1 J 


M 


1=1 


‘’i 


where  going  fron  the  second  statement  to  the  third 
statement  depends  upon  the  unique  properties  of  the 
prime  numbers. 


► f 


Consequently,  any  integer  programming  problem  (IP) 
with  integer  coefficients  in  the  constraints,  can  be 
rewritten  as  a singly  constrained  integer  programming 
Problem  (K). 


N 

(K)  Bin  S C . X, 

j-1  ^ J 

N M M 

St  r ( S a.,  inp  ) X = S 

j=l  1=1  ^ J 1=1 

Xj  ^ 0,  Integer  j = 1,....,N 

He  refer  to  (K)  as  the  knifedge  problem  because  of 
three  characteristics  which  distinguish  it  from  the 
knapsack  problem.  The  first  characteristic  is  that  the 
constraint  coefficients  are  not  the  desired  small 
integers.  Rather,  they  are  irrational  numbers  and  any 
transformation  which  does  not  sacrifice  accuracy  would 
result  in  integer  coefficients  of  infinite  size.  The 
second  characteristic  is  that  the  constraint  is  an 
equation,  making  the  usual  linear  relaxation  (obtained 
by  taking  the  variables  in  order  of  their  ratios 

M 

Cj/  SaylnPj 

and  setting  the  fractional  variable  to  the  largest 

smaller  integer)  almost  useless..  Indeed  the  constraint 

defines  a fine  "edge"  with  no  width  to  it  at  all.  The  I 

third  characteristic  is  that,  quite  likely,  some  ^ 

coefficients  of  the  objective  function  or  the  constraint  > 

i 

■ 

i 
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■ay  be  negative.  Unlike  a normal  knapsack  problem,  if 
the  objective  and  constraint  coefficients  of  any 
variable  are  of  unlike  signs,  that  variable  cannot  be 
assigned  immediately  a value.  Ibis  is  due  to  the 
equation  constraint.  It  is  possible,  however,  to  modify 
(IP)  so  that  (K)  will  have  all  positive  constraint 
coefficients. 

To  (IP)  append  the  constraint 
N+1 

S X = B 

where  B is  a sufficiently  large  number  and  is  a slack 

variable  with  = 0.  We  have 

N+1 

(IP')  Min  S C,  X . 

j = l ^ ^ 

N+1 

St  Sa^^.X.  = i = 1,.«.,M 

j-1  ^ ^ 

N+1 

S X = B 
j-1  ^ 

Xj  ^0,  Integer  j = 
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Let  P be  a sufficienly  large  prime  nrmber  such 

that 


usicg  (IP'),  the  resulting  knifedge  problem  (K')  will 
have  nonnegative  constraint  coefficients.  In  practice 
it  will  usually  be  more  productive  to  achieve  the  same 
result  in  another  manner.  Suppose  the  constraint 


coefficient  of  X is  negative.  Then,  if  U is  an  upper 

J J 

bound  for  (in  practice  an  upper  bound  can  usually  be 


assumed) , X.  may  be  replaced  by  0 - X . This,  too, 

J J 1 

will  result  in  a knifedge  formulation  with  nonnegative 


constraint  coefficients. 


Consider  now  (K)  , the  Knifedge  interpretation  of  a 
general  integer  programming  problem..  For  ease  of 
notation  we  write  (K)  as 
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N 

(K)  Hio  S C. X . 

j-1  ■’ 

N 

St  2 X . = b 

j=l  ^ ^ 

Xj  > 0,  Integer  j = 


In  general  (K)  can  be  quite  ditficult  to  solve. 
Since  linear  programming  (and  other)  relaxations  tend  to 
be  very  weak  due  to  the  equality  constraint,  it  appears 
to  be  desirable  to  avoid  relaxation  procedures. 
Alternatively,  shortest  path  and  dynamic  programming 
offer  solution  techniques  which  avoid  this  problem. 
Assume  for  the  moment  that  the  coefficients  a^,  b of  (K) 
are  integer.  Then  (K)  may  be  interpreted  as  a shortest 
path  problem.  Let  G = (V,A)  he  a directed  graph  with 
vertex  set  V = (0 , 1, 2,. . . , b)  and  arc  set  A = {(i,k):k-i 
= for  some  j,  j = 1,...,N)  [ Garfinkel  & Nemhauser 
(1972)  ].  Then  solving  (K)  corresponds  to  finding  the 
shortest  path  in  G.  Note  that  if  a^  > 0,  j = 1,...,N, 
the  graph  G contains  no  cycles,  since  a^  > 0 for  all  j 
iaplies  that  if  (i,k)  is  in  A,  then  k > i.  He  will 
always  assume  a^  > 0 in  (K)  without  loss  of  generality. 
Then  (K)  may  be  represented  as  an  acyclic  shortest  path 
problea. 
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5.3  Muaerical  Considecations  - Finite 

Approximations  to  the  Logs  of  Primes 

In  section  5.2  we  have  assumed  that  the  coef- 
ficients a ,5  of  (K)  are  integer.  In  practice  this  is 
j 

reasonable  since  any  computer  implementation  will 
require  the  use  of  rational  numbers..  However,  our  proof 
in  Section  5.2  that  (K)  is  equivalent  to  (IP)  depended 
on  the  use  of  the  exact  logs  of  primes.  (K)  with 
irrational  coefficients  a^ , £ may  still  be  represented 
as  a shortest  path  problem. 


Let  G = (V,A)  be  a directed  grapn  with  vertex  set  V 

= {v;v  = a^Xj  and  vl  b where  S S {1,2,. . . ,N}  , Xj  i 0, 

integer}  and  arc  set  A = ((i,k)  :k  - i = a^  for  some  j = 

1,2,... ,N}.  Then  solving  (K)  corresponds  to  finding  the 

shortest  path  in  G.  Note  that  each  variable  in  (IP) 

corresponds  to  an  arc  xj  at  each  vertex  v..  Therefore, 

choosing  an  arc  X^  to  be  in  the  path  is  equivalent  to 

incrementing  by  one  the  value  of  the  variable  X..  Ue 

^ M ^ 

define  the  SPAN  of  an  arc  X to  be  a = S a InP  . 

J J * 

(The  natural  concept  here  is  one  of  length  - 
unfortunately  early  practitioners  with  shortest  path 
problems  gave  the  word  length  another  connotation. 
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aaaely  cost).  He  define  the  span  of  a path  to  be  the 
SUB  of  the  spans  of  the  arcs  making  up  that  path.  The 
span  of  a node  is  sioply  the  span  of  any  path  reaching 
that  node. 


In  practice  it  is  not  possible  to  use  the  exact 
logs  of  primes.  Are  we  then,  justified  in  solving  (K) 
in  place  of  (IP) ? Consider  the  following  reformulation 
of  (IP) . 


Let  P* 


= [ lO^lnP^  ] t integer 


where  [a]  = largest  integer  less  than  or  egual  to  a. 


“5  ' S “ij 


♦ P* 
Mfl 


b.  = S bj  PJ  ♦ 


Then 


j=l  J ^ 


a*  X , = b* 

3 j 


Xj  Z.  Integer 


j — 


is  a relaxation  of  (IP) . Moreover  it  is  a relaxation 
which  can  be  made  arbitrarily  close  to  (K)  in  the 
natural  sense  by  choosing  t large.  (Another  way  of 
tightening  the  relaxation  (K*)  would  be  to  use  surrogate 
duality  [Glover  (1968,  1975),  Greenberg  £ Piers)calla 
(1970),  Karwan  & Sardin  (1976)].) 

Proposition  5.1:  If  we  assume  X is  bounded  in  (K) , 
then  by  approximating  the  irrational  coefficients  a.,b 
of  (K)  closely  enough  by  rational  numbers  a^,b*;  (K) 

may  be  reformulated  as  an  equivalent  Icnapsack  problem 
with  rational  coefficients. 

Proof:  If  we  assume  the  variables  X are  bounded, 

X e S = {X:  0 < Xj  < U^,  Xj  Integer,  j = 1,...,N),  then  S 

is  finite.  Let  3'  = {X  e S:  aX  b) . For  any  X e S', 

(ax  - B)  ^0,  so  there  exists  a <5  > 0 and  a neighborhood 

Njj(a)  such  that  the  absolute  value  of  (aX  - b)  is  greater 

than  5 > 0 for  all  a c N^(a)  . Then  there  exists  a 

neighborhood  N^(6)  such  that  aX  ^ b for  all  a in  Nj^(a) 

and  all  b in  W„(6).  Let  H (a)  = n N„(a)  and 

xes' 

H (b)  = n N (b)  . Then  aX  ^ b for  all  a e N(a)  and 
^ xes’ 

b c N (b)  and  X e S'.  Consider  a set  of  rational 
coefficients  {(a’^,b’^)  e a^,b'^  rational,  j = 1,2,...,M 
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i 

» 

It 

k = 1,2,...)  converging  to  (a,b) . Then  all  but  a ^ 
finite  number  of  problems 

(p’^)  Min  CX 

St  x:  a’^X.  = b*^ 

j = 1 ^ ^ 

define  problems  equivalent  to  (K) . Consequently  by 
choosing  t large  enough  (but  finite),  (K*)  will  be 
equivalent  to  (K) . 

The  advantage  of  (K*)  is  that  the  size  of  the 
shortest  path  problem  (b*  nodes)  may  be  controlled  by 
the  user.  By  properly  choosing  t and  the  base  of  the 
logarithms,  b*  can  be  chosen  arbitrarily. 


By  choosing  t large  we  may  approximate  (IP)  or  (K) 
very  closely.  How  does  (K*)  differ  from  (K)  ? To  answer 
this  we  must  develop  a bit  of  machinery.  Consider  now 
the  shortest  path  interpretation  of  (K) , the  knifedge 
representation  of  (IP) . Choosing  an  arc  X^  to  be  in  the 
path  corresponds  to  using  a certain  amount  of  resources 
in  the  original  problem  (IP)  represented  by  the  vector 
He  shall  prove  in  Proposition  5.2,  that  a path  of 
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span  P corresponds  to  a unique  vector  of  resource 
utilization  in  (IP). 


Proposition  5.2:  Consider  an  integer  programming  problem 
(IP)  and  the  corresponding  knifedge  problem  (K)  with  its 
shortest  path  interpretation.  Consider  two  sets  of  arcs 
S = {X'fJt;k  = 1,2,.. .,k^  ) and  S2  = {Xh*':k  = 1,...,k2} 

1 Djj  k 

If  the  two  sets  of  arcs  have  equal  span, 

M *^2  M 

i-'-  E ,Z  = s E “ih  1”^ 

k=l  1=1  k k=l  1=1  k 


then  they  must  cover  precisely  the  same  rows  the  same 
number  of  times  each,  i.e.  they  must  correspond  to 
precisely  the  same  resource  vector. 


Proof: 


1 M 


E E ^ij  InPi 

k=l  i=l  ^k  ^ 


1 M 


n n pfi^ik 


k=l  i=l 


^ j 

n " 


f ,E  a,  UP, 

k=l  1=1  k 


2 M 

n n p'^^k 

k=l  i=l 


" 2 a.K 

n pf^  " 

J _ « * 


1 


r 


"■1 

S a. 


= > 


.k=l 


2 a 

k=l  ^''k 

i=1,...,H 


= > 


E 

k=l 


iJ, 


= 2 ®lh 

k=l  “'k 


i = 


(since  Integer) 


QED 


The  proof  of  Proposition  5.2  depends  upon  the  use 
of  exact  logs  of  primes,  consequently,  (K*)  is  only  an 
approximation  of  (K)  since  it  is  possible  for  a path  of 
span  P to  represent  more  than  one  vector  of  resource 
utilization.  It  is  the  case  that  tvo  paths  and  S2  in 
the  shortest  path  representation  of  (K*)  having  the  same 
resource  utilization 


i.e. 


M 

S S 


XjESj  1=1 


M 

s s 


a. 


XjeS2  i=l 


will  have  the  same  span 


M 


i.e. 


'pi 


M 


XTeS,  1-1 


= S S a P* 

XjeS2  1-1  ^ 
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where  (K*)  fails  is  that  it  may  find  an  optimal 
solution  S that  is  not  feasible  in  (IP).  That  is,  the 
utilization  rector  of  the  path  corresponding  to  S may 
not  be  equal  to  b,  the  right  hand  side  vector  in  the 
constraints  of  the  original  integer  programming  problem. 
This  is  because,  due  to  approximating  the  logs  of 
primes,  two  paths  of  equal  span  may  not  correspond  to 
the  same  utilization  of  resources  vector.  We  regard 
(K*)  as  having  a resolution  problem.  • With  t very  small 
(say  3)  the  nodes  of  the  shortest  path  representation 
tend  to  become  blurred  together  and  the  algorithm  may 
perceive  two  actually  distinct  nodes  as  being  one  and 
the  same  node.  Increasing  t increases  the  algorithm's 
power  of  resolution. 

This  suggests  a method  for  solving  the  relaxation 
(K*)  in  a manner  that  ensures  it  will  be  an  equivalent 
problem  to  (IP) . Suppose  (K*)  is  being  solved  as  a 
shortest  path  problem  by  a forward  reaching  algorithm 
(e.  g.  Algorithm  3.2).  At  a node  (with  span)  j we  check 
each  arc  originating  at  j.  Let  the  first  arc  be  i. 

Then  arc  i plus  the  path  to  j form  a path  to  (j  span 
of  arc  i)  , (say  k) . If  node  k has  a label  less  than 
infinity,  we  now  require  that  the  algorithm  check  that 
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the  incumbent  path  to  k and  the  new  path  (to  j then  k 
via  i)  have  the  same  resource  utilization  vector.  If 
they  do,  we  proceed  as  usual.  If  they  do  not,  then  we 
arbitrarily  give  the  new  path  (to  j then  k via  i) 
span  k 4 1 and  begin  again  with  arc  i.  Provided 

t is  reasonably  large,  this  method  should  resolve  all 
but  the  most  pathological  cases.  In  practice  we  have 
used  t = 11  or  13  and  have  not  used  the  above  refinement 
to  our  algorithm  as  its  computational  burden  is  very 
high. 

It  is  also  possible  to  interpret  (K*)  from  a 
geometric  point  of  view.  In  the  knifedge  problem  (K) 
the  constraint 

N 

S a X = B 

j“l  ^ J 

represents  a hyperplane  H in  R^.  In  all  of  the  only 
integer  points  in  H are  those  that  are  feasible  in  (IP). 
And  H Includes  all  (IP)  feasible  integer  points.  The 
constraint 

S a*X,  = b* 

j-1  i J 

in  (K*)  may  be  seen  as  a hyperplane  H*  approximating  H. 
H*  is  an  approximation  in  that  it  still  contains  all 
(IP)  feasible  integer  points  but  may  contain  additional 
integer  points  as  well.  Consequently,  it  is  a tilted 
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YeL'sioii  of  H,  Kheri:  n*  l»af>  u totaled  slicjhtly  about 
the  axi^;  which  i;;  the  altino  :;ubaet  of  defined  by  l he 
constraints  of  (IP) . Since  H is  a finite  distance  from 
all  (IP)  infeasible  iuteget  (ioints  it  may  be  rotated  a 
finite  amount  about  this  axis  without  including  any 
non-fcasihle  integer  points.  This  corresponds  exactly 
to  choosing  t large  enough  that  II*  rotates  little  enough 
to  include  no  non- feasil)ie  intCMjer  points. 


5.4  Induced  Linear  Orderings  in  Cembina* orial 
Problems 


The  transformation  whicit  took  us  iiom  (IP)  to  (K) 

■ay  be  looked  at  as  one  that  induces  a linear  otdering 

on  the  sot  of  partial  solutioji:  {X;X  - {X  , . . . , Xj^}  , 

k < N).  n ore  precis»;ly,  it  may  be  scrji  as  inducing  a 

M 

linear  on<;-to-one  mapping  from  the  vectors  A ^ in  Z in 
resource  space  to  tlio  real  numbers. 


P:  2 ->  R 


j » * 


r..  InP 


P(Aj  ) 


a, ^ InP. 
i-1  ^ 


r 


Consider  a partial  solution  X to  (IP)  # X = {Xj  ,X2» 
where  for  ease  of  notation  the  variables  X^ 
have  been  reordered  so  that  the  variables  with  assigned 
values  (i. e.  in  the  partial  solution)  are  j = 

Then  % uses  a certain  amount  of  resources, 

namely 

k 

j=l 

or  equivalently, 
k 

2 A.  X 

^ 3 i 

j=l  ^ 

Consequently,  we  can  extend  the  mapping  P to  map 
(partial)  solutions  of  (IP)  into  the  real  numbers. 

k 


P(X)  = P(  S AX) 
j=l  J ^ 

k 


r 


1 


I 


;j 


This  extended  oapping,  P,  is  not  necessarily 
one-to-one  since  two  different  partial  solutions  nay 
have  the  same  resource  utilization  vector.  It  is  this 
fact  which  may  be  exploited. 

Consider  a decision  tree  as  normally  developed  in 
branch  and  bound  procedures  for  integer  programming. 

See  Figure  5.1.  For  example  branching  may  be  binary  (X^ 
= 0 and  X^  = 1)  for  0-1  integer  programming  problems.  It 
is  characteristic  of  branch  and  bound  procedures  that 
each  node  k corresponds  to  a partial  solution  to  (IP)  . 
Consequently,  our  extended  mapping  can  be  considered  as 
mapping  each  node  to  the  real  numbers  (P  (kj } . In  other 
words  we  are  able  to  induce  a linear  ordering  on  the 
nodes  of  the  search  tree.  See  Figure  5.2. 

It  is  very  often  the  case  in  branch  and  bound  that 
identical  subtrees  are  generated  over  and  over  again. 
This  occurs  when  two  nodes  (or  partial  solutions) , 
neither  one  descended  from  the  other,  have  the  same 
resource  utilization  vectors 

i-e.  i 

X^eX 
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and  the  same  set  of  unassigned  variables. 


This  duplication  of  effort  is  highly  inefficient. 
However,  attempts  to  avoid  it  [Harsten  & Horin  (1978)  ] 
have  required  saving  an  H dimensional  resource 
utilization  vector  for  each  node  generated  using  a new 
combination  of  resources.  The  mapping  P maXes  it 
possible  to  achieve  the  same  end  by  storing  a single 
number  for  each  node  generated.  Consequently,  the  best 
properties  of  branch  and  bound  techniques  and  ox 
shortest  path  methods  may  be  combined  into  a single 
procedure. 


Consider  a shortest  path  problem  derived  from  the 
Xnifedge  interpretation  of  a 0-1  integer  programming 
problem.  The  advantage  of  using  a shortest  path 
algorithm  (for  example  Algorithm  3.2)  is  that  it  tends 
to  minimize  duplicated  efforts.  The  reason  for  this  is 
that  only  a single  path  is  generated  from  the  origin  to 
each  node.  Consider  the  case  in  Figure  5.3.  (Note  that 
each  variable  in  the  original  problem  corresponds  to 
an  arc  ^ j ^t  each  node  v. ) There  are  two  distinct  paths 
to  the  fourth  node,  (X°)  and  (X^  xj).  In  the  shortest 
path  algorithm  the  path  from  node  4 to  the  end  is 
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considered  only  once.  But  in  a binary  search  tree  where 


Xj  is  the  separation  variable  at  level  j (see  Figure 
5.4),  the  path  from  node  4 (of  the  shortest  path 
representation)  to  the  end  would  have  to  be  considered 
twice  (from  nodes  8 and  13  in  Figure  5..4}. 


If  we  were  to  use  the  mapping  P,  the  decision  tree 
of  Figure  5.4  could  be  redrawn  as  in  Figure  5.5,  where 
the  height  of  a node  in  the  decision  tree  indicates  its 
position  in  the  linear  ordering  induced  by  P.  Nodes 
having  the  same  height  can  be  recognized  as  having  the 
same  resource  utilization  vector.  Nodes  at  the  same 
height  and  the  same  level  are  equivalent  (i.e.  they 
have  the  same  resource  utilization  and  same  set  of 
assigned  variables).  The  subsequent  development  for 
them  need  only  be  done  once. 


The  advantage  of  the  branch  and  bound  procedure  is 
that  we  are  likely  to  find  a feasible  solution  early  on 
in  the  search.  We  can  then  use  this  incumbent  solution 
to  bound  other  nodes.  Host  shortest  path  algorithms,  on 
the  other  hand,  do  not  generate  a feasible  solution 
until  the  procedure  is  very  near  to  its  conclusion, 
(fethods  have  been  devised  to  eliminate  this  shortcoming 
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Figure  5.4 


I 

1 


'i 


of  shortest  path  algorithas  [Shapiro  & Wagner  (1967), 
Glover  (1967),  Shapiro  (1968)  ].  For  example,  if 
Glover's  algorithm  is  interpreted  in  a shortest  path 
context  (it  is  developed  as  a dynamic  programming 
recursion  technique)  it  can  be  seen  to  be  equivalent  to 
a shortest  path  algorithm  that  bounds  any  path  which 
becomes  longer  than  the  shortest  path^  With  the  napping 
P it  is  possible  to  develop  an  algorithm  equivalent  to 
Glover's  which  will  work  on  any  integer  programming 
problem. 


5.5  Applications  to  the  Set  Partitioning  Problem 

In  Section  5.4  we  have  indicated  how  the  use  of 
prime  numbers  might  expedite  branch  and  bound  procedures 
for  general  integer  programming  problems.  For  the  set 
partitioning  problem,  because  of  its  special  properties, 
ve  can  do  much  better. 

The  first  property  of  set  partitioning  problems  is 
that  the  variables  nay  be  constrained  to  be  0 or  1.  In 
practice,  though,  it  is  not  necessary  to  consider  the 
upper  bounds  since  the  constraints  themselves  do  not 
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allow  any  variable  to  exceed  one.  It  is  true  for  all 
0-1  integer  programming  problems  that  partial  solutions 
correspond  to  subsets  of  where  X corresponds 


1) . Then  if  the  set  partitioning  problem 


0,1,  the  Jenifedge  representation  (KSP)  is 


As  shown  in  Section  5.3  (KSP)  may  be  interpreted  as 
a problem  of  finding  a shortest  path  on  a graph  G » (V,A) 


where  V = {v;v  = ^ 


a.  .IqP.  )X.  where 
i=l  J 


S C {1,2,...,N}  and  X.  = 0,1) 


i.e.  where  V = {v:v  = 


S ^ {1 « • ^ # N)  } ,« 


iLrf  ( ^ ) where 


jeS  i=l 


Conseguently  there  are  at  aost  2"  nodes  or  vertices 
on  the  Graph  G. 


Another  characteristic  is  that  all  of  the  coef- 
ficients of  the  constraints  are  either  0 or  1.  The 
right  hand  side  is  all  ones.  Consequently,  the  only 
nodes  which  need  be  considered  are 


= {v;v  = 


InPi  where  T ^ {1,....  ,H}} 


This  is  because  these  are  the  values  of  v which 

correspond  via  the  one-to-one  napping  P to  feasible 
. . M 

resource  utilizations  in  Z . Consequently,  there  are  at 
H M 

Bost  Nin  (2  ,2  ) nodes  or  vertices  on  the  graph  G. 


s 


Be  can  sake  farther  use  of  the  logical  ioplications 
of  the  set  partitioning  problem.  Ne  begin  by  reform" 
alating  the  Principle  of  Optimality  in  terns  of 
shortest  path  problems. 


[ 0 

*1  '*2  ■'k 

in  the  shortest  path  representation  of  (KSP)  , the 


Let  X = {X^^  ,xX2  be  an  ordered  optimal  path 

J 1 Jo  Ju 


knifedge  reformulation  of  the  set  partitioning  problem 

(SP).  The  unique,  so  we  suppress  the 

superscripts  v with  the  understanding  that  each  arc  X. 

i _ 

begins  where  the  previous  arc  Xi  ends.  Then  X = 


{X  , . . . X^  } 


. The  set  X*  = {X  :X  = 1 for  all  X e X, 


Xj  B 0 otherwise}  is  an  optimal  solution  to  (KSP)  and 

(SP) . A SUBPATH  X^  of  X is  the  ordered  subset  of  the 

first  P elements  of  X,  X = {X.  ,X.  ,«..,X.  },  p k. 

Note  the  elements  of  Ic  must  retain  the  order  they  have 

P 

in  1.  2p  corresponds  to  a partial  solution  X*  of  (SP) 

where  X*  = {X,  :X,  = 1 for  all  X e X„,  X undetermined 
P j j i ^ i 

otherwise) . 


Principle  of  Optimality:  An  ordered  optimal  path  X s 


(X 


,X.  ) to  a shortest  path  problem  is  said  to 
^k 


satisfy  the  Principle  of  Optimality  if  and  only  if  any 

ordered  subpath  X = (X.  ,...,X,  ],  p <,  k,  has  the 

P Ji  Jp 
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• property  that  it  is  an  optimal  path  for  the  nodes  it 

covers. 

I 

' Notice  that  some  ordering  of  {X.  ,X.  ,...,X.  } is  a 

solution  to  the  shortest  path  problem  which  can  be 
. generated  by  usual  techniques  (e.g.  Algorithm  3.2). 

However,  in  general  it  is  not  the  case  that  an  arbitrary 
' permutation  of  {X.  ,X.  ,...,X.  ) will  satisfy  the 

! principal  of  optimality.  Consider  the  problem 


Min 

^ 

X2  ♦ 

3X3 

St 

Xi  + 

2X2  + 

3X3  = 4 

= 

0,1 

j = 1,2,3 

The  optimal  solution  is  Xj^  = X^  = 1,  X2  = 0.  The 
corresponding  optimal  path  to  the  shortest  path 
interpretation  is  (X^^rX^).  This  permutation  satisfies 
the  principle  of  optimality.  However,  the  permutation 
(X^rXj^)  does  not  since  (X^)  is  not  an  optimal  subpath 
((Xj^,X2)  is  a better  subpath).  Consequently,  it  is  not 
the  case  that  an  arbitrary  permutation  can  be  generated 
by  the  usual  shortest  path  algorithms.  However,  it  is 
the  case  for  set  partitioning  problems  that  any 
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permutation  of  an  optimal  answer  will  satisfy  the 
principle  of  optimality. 


Proposition  5.3:  For  the  set  partitioning  problem  (SP) 
and  the  corresponding  knifedge  problem  (KSP) , any 
permutation  of  an  optimal  answer  will  satisfy  the 
principle  of  optimality  with  respect  to  the  shortest 
path  interpretation  of  (KSP)  . 

Proof:  Consider  a permutation  X = {X.  ,X . ,...,X. 
of  an  optimal  solution  to  (KS?) . Consider  a subpath 
X = {X.  ,X.  ,...,X.  )/  p £ k,  with  span  P.  Suppose 
Xp  is  not  an  optimal  path  with  span  P,  but  rather 

= {Yj^  ,Y2  ) is.  Then  by  Proposition  5.2 

and  the  fact  that  no  row  of  the  original  set  parti- 
tioning problem  can  be  covered  more  than  once,  we  have, 
Y^.  n (X  \Xp)  = 0.  But  then  Y^  U (X  \X^)  is  a better 
solution  than  X to  the  original  problem,  which 
cannot  be  the  case. 


QED 
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Proposition  5.3  is  very  powerful.  It  allows  us,  in 
advance,  to  order  the  optimal  solution  in  any  fashion  we 
choose.  We  choose  to  order  the  arcs  of  the  solution 
according  to  the  row  containing  their  first  entry.  Then 
at  any  node  of  our  shortest  path  algorithm  we  need 
consider  only  those  arcs  which  begin  at  the  first 
uncovered  row.  And  of  those  arcs  we  need  consider  only 
those  that  do  not  conflict  with  the  shortest  path  to  the 
node  being  considered.  This  drastically  reduces  the 
number  of  nodes  generated. 

Consider  the  following  algorithm  to  solve  a set 
partitioning  problem  (SP)  . 

Algorithm  5.1: 

STEP  1 From  (SP)  form  the  corresponding  knifedge 
problem  (KSP) . Interpret  (KSP)  as  a shortest  path 
problem  where  the  number  of  nodes  is 

M 

y Inp. 
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] where  P.  is 

1 V . 

] arc  IS 

‘I' 


the  ith  prime  number. 


M 

E 


i=l 


a . . InP . . 


and  the  span  of  each 


I 


STEP  2 Solve  the  shortest  path  problem  generating 
nodes  as  needed.  A node  is  characterized  by  its  span 
from  the  origin,  which  is  given  by  the  sum  of  the  spans 
of  the  arcs  in  a path  reaching  it.  At  a given  node 
consider  only  the  arcs  which  begin  in  the  first  row  of 
(SP)  not  yet  covered,  eliminate  those  which  conflict 
with  the  path  to  date. 

We  tried  implementing  Algorithm  5.1  and  found  that 
it  was  very  sensitive  to  problem  size.  Small  set 
partitioning  problems  were  solved  with  great  speed.  A 
test  problem  of  30  rows  and  168  columns  was  solved  in 
less  than  two  seconds  of  execution  tine  on  a CYBEE  175, 
generating  only  2314  nodes.  However,  the  same  problem 
with  ten  additional  rows  and  103  additional  columns 
generated  more  than  20,000  nodes  (in  7 seconds)  and  was 
abandoned  (it  was  approximately  one*half  of  the  way 
through  the  problem) . 


i 
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5.6  Applications  to  the  Helsinki  Problem 


r 

I, 


In  conjunction  with  the  HELSINKI  algorithm. 
Algorithm  5. 1 is  very  powerful.  The  iterative 
backward- forward  shortest  path  Algorithm  3.3  plus 
subgradient  optimization  allows  us  to  eliminate 
variables  at  each  node  vihich  cannot  be  eliminated  by 
logical  considerations  alone.  Our  experience  has  been 
that  75^  of  the  variables  can  be  eliminated  at  the  first 
level  of  the  decision  tree  (i.e.  when  only  one  variable 
has  been  chosen  to  be  in  the  solution).  This  makes 
Algorithm  5.1  very  attractive  as  a "relaxation”  in  the 
two  duty  period  scheduling  problem.  Our  practice  has 
been  to  use  Algorithm  5.1  when  the  number  of  variables 
remaining  in  the  problem  falls  below  some  number 
(between  60  and  120).  Algorithm  5.1  is  very  useful  in 
cleaning  up  small  problems  which  very  often  require  a 
significant  number  of  branches  otherwise.  Algorithm  5.  1 
may  also  be  a good  relaxation  to  incorporate  into 
implicit  enumeration  algorithms  for  solving  general  set 
partitioning  problems. 

An  additional  advantage  of  the  Knifedge  relaxation 
is  that  it  makes  it  very  easy  to  handle  certain  side 
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N 

E Xj  < K 
j = l ^ 

This  may  be  done  by  adding  a slack  to  get 


N 

E Xi  ♦ S = K 

j = l J 

Bultiplying  the  constraint  by  lnP„^,  and  adding  it  to 
the  Knifedge  constraint  for  (IP).  Then,  however,  the 
arc  S must  be  considered  at  every  node  and  the  path  is 
signif iciantly  longer  (by  KlnPj^^.j^)  . There  is  a way  in 
which  this  constraint  can  be  implemented  logically  so 
that  the  path  is  shorter  rather  than  longer.  At  each 
node  Algorithm  5. 1 checks  to  see  if  a feasible  solution 
has  been  found  by  finding  the  first  row  not  covered. 
Consequently,  whenever  a feasible  solution  is  found  the 
path  ends  there  and  the  solution  is  recorded  if  it  is 
the  best  so  far.  If  the  current  node  is  not  a feasible 
solution,  that  is  there  is  a row  < K not  yet  covered, 
then  the  number  of  arcs  in  the  path  to  this  node  is 
checked.  If  it  is  equal  to  K,  the  node  is  fathomed;  In 
this  way  many  more  nodes  are  fathomed  along  the  way  and 
the  arc  S is  never  considered. 
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Ve  have  used  the  Knifedge  relaxation  in  the  two 
duty  period  scheduling  problem  and  have  found  that  it 
significantly  reduces  the  running  time  (5  - 30%) . In 
addition ’it  gives  the  capability  of  more  easily 
considering  side  constraints. 


5.7  Applications  to  the  Set  Covering  Problem 

An  algorithm  similar  to  Algorithm  5.1  can  be 
developed  for  the  set  covering  problem  (SC) 

(SC)  Hin  CX 

St  AX  > 1 

X = 0,  1 

where  a^j  = 0,1;  C > 0. 

Consider  the  shortest  path  interpretation  of  the 
knifedge  formulation  (KSC)  of  (SC) . Ue  show  that  any 
permutation  of  an  optimal  solution  to  (SC)  will  satisfy 
the  principle  of  optimality. 
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Proposition  5. 4;  For  the  set  covering  problem  (SC) 
and  the  corresponding  knifedge  problem  (KSC) , any 
permutation  of  an  optimal  answer  will  satisfy  the 
Principle  of  Optimality  with  respect  to  the  shortest 
path  interpretation  of  (KSC). 

Proof:  Consider  a permutation  X = (X.  ,X.  ,...,X.  ) 
of  an  optimal  solution  to  (KSC).  Consider  a subpath 
X = (X.  ,X.  ,...,X.  ),  p^  k,  with  span  P.  Suppose 
Xp  is  not  an  optimal  path  with  span  P,  but  rather 

Yt  = (^1  •'^2  ^ ^ (X\Xp). 

Then  X^  covers  rows  of  (SC)  already  covered  by  X^  by 
Proposition  5.2.  Then  (X\X^)  is  a solution  to  (SC)  better 
than  X,  which  cannot  be.  Therefore,  n (X\Xp)  = ji 

which  implies  U (X\Xp)  is  a solution  to  (SC)  better 
than  X,  which  cannot  be. 


QED 

The  following  algorithm  will  solve  the  set  covering 
problem. 
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Algorithm  5.2: 


STEP  1 From  (SC)  form  the  corresponding  knifedge 
problem  (KSC) . Interpret  (KSC)  as  a shortest  path 
problem  where  the  number  of  nodes  is 


M 

E 

i = l 


InP 


where  P.  is  the  ith  prime  number,  and  the  span  of  each 

„ V 

arc  X . IS 
3 

M 

a . .InP . . 

J- 3 1 


E 

1 1 


STEP  2 Solve  the  shortest  path  problem  generating 
nodes  as  needed.  A node  is  characterized  by  its  span, 
which  is  given  by  the  sum  of  the  spans  of  the  arcs  in 
a path  reaching  it.  At  a given  node  consider  only  the 
arcs  which  cover  the  first  row  of  (SC)  not  yet  covered. 

We  show  that  Propositions  5.3  and  5.4  are  special 
cases  of  a more  general  result. 


« 


i 
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Proposition  5.5:  For  the  integer  prograoming  problem 
(IP)  with  no  upper  bounds  on  the  variables  and  the 
corresponding  knifedge  problem  (KIP) , any  permutation  of 
an  optimal  answer  will  satisfy  the  principle  of 
optimality  with  respect  to  the  shortest  path 
interpretation  of  (KIP)  . 

Proof:  Consider  a pernutation  X = (xTl , . . . , xY*^) 


of  an  optimal  solution  to  (KIP)  . Consider  a subpath 

= (xTi,...,x^i 
P 3 1 3, 


X = (xT^, . . . , X^P) , P ^ k,  with  span  P.  Suppose  X 


1 'P 

is  not  an  optimal  path  with  span  P,  but  rather 

= (y"1,. .. ,Y“t)  is.  Then  Y^  U (X\X  ) is  a 
t ' h^'  t P 

solution  to  (KIF)  better  than  X,  which  cannot  be. 


QED 


Note  that  the  proof  depends  upon  the  variables 
having  no  upper  bounds,  otherwise  Y^  U (X\Xp)  may 
violate  an  upper  bound  on  some  X^.  The  result  does  not 
hold  generally  for  problems  where  the  variables  are 
restricted  to  be  0,1.  However,  as  we  have  already  seen, 
in  the  set  partitioning  problem  the  variables  need  not 
be  constrained  to  be  0,1.  Similarly,  for  the  set 
covering  problem  with  C > 0,  the  variables  need  not  be 
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bounded  above.  Consequently,  Propositions  5.3  and  5.4 
follow  immediately  from  Proposition  5.5. 

Similarly,  for  any  0,1  integer  programming  problem 
where  the  constraints  force  the  0,1  condition. 
Proposition  5.5  holds.  An  example  would  be  a problem 
where  each  0,1  variable  is  included  in  a multiple  choice 
constraint  of  the  form 


S X,  = 1 

j cS 


where  S C {1,...,N). 


CHAPTER  6 


THE  GEIIEEAL  TWO  DUTY  PERIOD  SCHEDULING  PROBLEM 


The  gteneral  two  duty  period  scheduling  problem  (P) 
is  given  by 


Min  CX 


St  AX  = b 


X Integer,  Monnegative 

where  each  column  of  A contains  at  most  two  segments  of 
ones,  the  rest  of  the  entries  being  zero. 


As  developed  in  Chapter  2,  (P)  may  be  reformulated 


as  (p*) 


c’^Y  ♦ C^Z 


(Y,Z)  in  S 
lY  - 12  = 0 

Y,Z  Integer,  Nonnegative 
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where  S is  the  set  of  solutions  to  a network  flow 


problem. 

Much  of  the  methodology  developed  ir.  chapters  three 
and  four  may  be  simply  modified  to  fit  the  more  general 
two  duty  period  scheduling  problem.  The  relaxation  now 
is  a network  flow  problem  rather  than  the  shortest  path 
problem. 

For  the  same  reasons  that  the  shortest  path  was 
acyclic,  the  network  flow  problem  will  also  be  acyclic. 
Consequently,  we  still  have  a reasonably  easy  relaxation 
to  solve.  However,  in  the  case  of  greater  than  or  equal 
to  constraints  the  network  flow  problem  will  no  longer 
be  acyclic. 

The  major  drawback  with  the  general  two  duty  period 
scheduling  problem  is  that  we  are  not  able  to  make  such 
extensive  use  of  logical  reduction.  Much  of  the  power 
of  the  HELSINKI  algorithm  came  from  the  fact  that  once 
an  arc  was  chosen  to  be  in  the  shortest  path,  we  were 
able  to  eliminate  all  arcs  which  conflicted  with  it.  In 
the  case  of  the  network  problem  this  is  not  true. 
However,  some  logical  reduction  may  still  be  useful. 


i 

\ 


i. 

ll 

i 
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Consider  the  case  where  a^j  L 0 for  all  elements  a^^  of 
Aj  (i.e.  there  are  no  surplus  variables).  Suppose  we 
have  chosen  enough  arc  flows  (Xj , j in  J)  in  the  network 
flow  problem  such  that  the  ith  constraint  is  satisfied, 
i.  e. 


N 

E a x = b 
j-1  ^ 


Then  we  can  eliminate  (set  equal  to  zero)  all  other  Aj's 
(j  not  in  J)  such  that  a^j  > 0.  And  of  course  we  are 
still  able  to  eliminate  the  partner  of  any  eliminated 


Ve  can  also  use  subgradient  optimization  in  a 
completely  analogous  way.  ‘He  start  by  solving  the 
network  relaxation.  Then  by  the  use  of  subgradient 
optimization  we  reallocate  the  cost  of  a variable 
between  its  two  decoupled  arcs  in  such  a way  as  to  make 
them  more  nearly  equally  desirable.  Ibis  is  done  by 
increasing  the  costs  of  those  arcs  which  were  included 
in  the  network  solution  but  whose  partners  were  not 
included.  Simultaneously,  we  reduce  the  costs  of  the 
unincluded  partners  by  the  sane  amount.. 
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One  thing  that  will  have  to  be  done  differently 
with  the  general  two  duty  period  scheduling  problen  is 
the  branching  procedure  in  the  tree  search.  Rather  than 


using  as  a separation  variable  the  first  arc  in  the 
shortest  path  relaxation  solution  not  already  used  as  a 
separation  variable,  we  will  have  tc  choose  a branching 
procedure  adapted  to  the  network  relaxation. 
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CHAPTER  7 


COHPUTATIONAL  EXPERIENCE 

_ 

i 

In  this  chapter  we  present  our  computational 
experience  with  the  HELSINKI  algorithm.  In  Section  7.1 
we  discuss  the  test  problems  used.  Ue  follow  this  with 

results  on  the  logical  elimination  of  variables  and  ^ 

results  of  the  subgradient  optimization  procedure.  In 
Sections  7.4  and  7.5  we  present  computation  times  for 
the  test  problems  with  and  without  a side  constraint. 

Decision  trees  for  the  two  smaller  test  problems  are  j 

given  in  Section  7.7.  Section  7.8  provides  some  results  j 

of  the  prime  number  - shortest  path  algorithm  for  small  ! 

j 

test  problems.  ; 

1 

J 

7.1  The  Test  Problems 

Our  computational  experience  has  dealt  with  the 
Helsinki  problem  discussed  in  Chapter  3: 

■ . j 
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(HP)  Min  CX 

St  AX  = 1 

X = 0,1 

where  each  column  of  A contains  at  most  two  segments  of 
ones,  the  rest  of  the  entries  being  zero.  (HP)  is  a two 
duty  period  scheduling  problem.  It  is  also  a set 
partitioning  problem  and,  conseguently , good  methods 
already  exist  for  solving  it.  Our  intention  was  to  test 
our  method  for  solving  the  two  duty  period  scheduling 
problem  by  considering  first  the  Helsinki  problem.  If 
we  were  able  to  obtain  results  comparable  to  the  best 
existing  set  partitioning  algorithms,  then  Algorithm  2.1 
should  provide  a reasonable  method  for  solving  the  more 
general  two  duty  period  scheduling  problem 

(P)  Min  CX 

st  AX  = b 

X Integer,  Ncnnegative 

for  which  no  good  solution  techniques  exist. 

The  test  problems  used  in  evaluating  our  HELSINKI 
algorithm  were  sent  to  us  by  Markku  Tamminen  of  the 
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Helsinki  City  Transport.  The  four  test  problems  have 
roughly  the  same  size  but  varying  densities.  See  Table 

7.1. 


Problem  1 was  the  original  Helsinki  test  problem 
and  has  been  studied  rather  thoroughly  by  us  and  others. 
Tamminen  tested  a number  of  set  partitioning  algorithms 
on  problem  1 [ Koljonen  6 Tamminen  (1977)  ].  These 
included  the  Garf inkel-Nemhauser  algorithm  and  Marsten's 
set  partitioning  algorithm,  SETPAR.  In  addition 
Tamminen  tested  numerous  variations  of  the  Garfinkel- 
Nemhauser  algorithm,  with  and  without  linear  programming 
and  taking  advantage  of  the  special  two  segment 
structure  of  the  problem.  His  conclusion  was  that 
SETPAR  outperformed  any  other  method  they  could  devise. 
Throughout  this  chapter  our  results  with  the  HELSINKI 
algorithm  will  be  compared  with  SETPAR,  it  being  the 
best  known  existing  algorithm  for  solving  the  Helsinki 
problem. 
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Table  7.1 


7.2 


Effectiveness  of  Logical  Reduction  in  the 
HELSINKI  Algorithm 


Our  experience  is  that  both  of  these  features  are 
instrumental  in  significantly  reducing  the  problem  size. 
Table  7.2  indicates  the  reductions  we  have  experienced. 


One  of  the  main  features  of  the  HELSINKI  algorithm 
is  its  reliance  on  logic  to  reduce  the  problem  size. 
This  logical  reduction  entails  two  features.  The  first 
is  to  eliminate  any  variables  which  conflict  with 
variables  already  chosen  to  be  in  the  solution.  The 
second  is  to  eliminate  any  variables  which  can  not 
participate  in  a solution  improving  upon  the  best  known 
solution  (the  incumbent) . 


Table  7.2  does  not  accurately  reflect  the  number  of 
variables  which  could  be  eliminated  at  each  level.  As 
soon  as  a branch  was  bounded  or  found  to  be  infeasible, 
no  more  variables  were  eliminated.  In  those  cases  one 
hundred  per  cent  of  the  variables  could  have  been 
eliminated. 
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kt  level  one  ve  have  recorded  the  percentage  of 
variables  for  two  cases.  The  first  case,  1a,  is  the 
first  branch  at  level  1 (when  there  was  no  incumbent 
solution)  and  the  second  case,  1b,  is  for  subsequent 
branches  (when  there  was  an  incumbent  solution).  The 
difference  in  these  two  percentages  indicates  the 
additional  power  gained  from  the  elimination  of 
variables  whic'i  cannot  take  part  in  a solution  better 
than  the  incumbent.  This  elimination  was  achieved  by 
the  forward-backward  iterative  solution  process  for  the 
shortest  path  problem  as  described  in  Chapter  3, 
Algorithm  3.3. 

Note  that,  for  problems  1 and  4,  fewer  variables 
are  eliminated  at  level  4 than  at  level  3.  This 
apparent  anomaly  is  due  to  the  fact  that  level  4 was 
visited  only  when  there  was  no  incumbent  solution  or 
only  a weak  incumbent  (see  Figures  7.1  and  7.2). 


7.3  Results  of  the  Subgradient  Optimization  Procedure 

We  found  the  subgradient  optimization  procedure  to 
be  very  successful  in  tightening  our  relaxation.  Recall 
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that  the  Lagrangean  relaxation  of  the  Helsinki  problem 
has  the  integrality  property.  Therefore,  we  know  that 
the  subgradient  optimization  procedure  can  be  expected 
to  converge  to  the  linear  programming  relaxation 
solution.  In  practice  we  were  able  to  approximate  the 
linear  programming  solution  as  closely  as  we  wished  by 
simply  increasing  the  number  of  iterations  in  the 
subgradient  optimization  procedure.  The  most  difficult 
aspect  was  determining  the  best  number  of  iterations. 

We  finally  settled  on  a number  of  iterations  (about  600 
at  level  0)  which  resulted  in  a relaxation  value  within 
5%  of  the  linear  programming  value  (see  Table  7.  1) . We 
found  that  advantages  gained  by  tightening  the  relaxa- 
tion even  further  were  more  than  offset  by  increased 
computation  times  in  performing  the  additional  iterations 
needed . 


As  outlined  in  Chapter  4,  at  each  iteration  of  the 
subgradient  optimization  we  determine  a new  set  of  costs 
for  the  decoupled  arcs  and  then  resolve  the  shortest 
path  relaxation.  The  costs  at  iteration  t + 1 are 

computed  by  the  following  formula 


1 


■j 

i 
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where 


I 

j 


« 1 if  arc  j is  included  in  the  current 

solution  and  its  partner  is  not 
= -1  if  arc  j is  not  included  in  the 

current  solution  and  its  partner  is 
= 0 otherwise 


was  determined  according  to  the  formula 
= d*^  [W  - ]/  |s*^|^ 


There  are  two  degrees  of  freedom  in  determining  P*^. 
The  first  is  the  choice  of  the  multiplier  d^,  and  the 

Ak 

second  is  the  choice  of  the  target  value  H.  He  have 
tried  a number  of  procedures  for  choosing  the  target 
value.  Our  final  algorithm  incorporates  many  of  them. 


Be  have  found  that  at  the  very  top  of  the  decision 
tree,  before  any  subgradient  optimization  has  been  done, 
the  shortest  path  relaxation  is  very  weak.  Character- 
istically, it  has  been  on  the  order  of  twenty-five  per- 
cent of  the  linear  programming  value  and  twenty-two  per- 
cent of  the  optimal  value.  If  we  were  to  use  the  optimal 
answer  itself  as  the  target,  the  perturbation  term 


i 

! 

t 

i 


4 
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(d^fW  - W(0*^)  ]/  |s*^|^)  in  the  equation  above  would  be 
very  large.  Our  experience  in  this  case  is  that  the 
subgradient  optimization  performs  very  poorly  until  the 
multiplier  d^is  adjusted  to  compensate  for  the  large 

^ t 

1 1/  - K (U  } ].  Consequently,  at  the  top  of  the  tree  we 
use  for  the  target  value  two  times  the  best  relaxation 
value  found  so  far.  Similarly,  at  lower  levels  of  the 
tree,  until  an  incumbent  solution  is  obtained,  we  use 
1.2  times  the  best  relaxation  value  found  so  far  at  that 
branch. 

A second  alternative  is  fcr  the  user  to  set  the 
target  value.  In  many  real  world  implementations  it 
turns  out  that  the  user  has  a very  accurate  notion  of 
what  the  optimal  value  will  be.  In  such  situations  it 
seems  best  to  take  advantage  of  such  knowledge.  After 
trying  a number  of  alternatives,  we  have  settled  for 
using  such  a guess,  if  provided,  only  after  the  first 
level  of  the  decision  tree. 

A third  alternative  is  to  take  advantage  of  the 
relaxation  value  obtained  at  level  0 to  set  targets  at 
lower  levels.  This  value  closely  approximates  the 
linear  programming  value.  There  is  wide  experience 
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concerning  the  gap  between  the  linear  programaing 
solution  and  the  integer  solution.  (Our  experience  with 
these  four  test  probleas  is  that  they  have  abnoraally 
large  gaps  - see  Table  7.1.)  It  is  possible  to  use  the 
relaxation  value  at  level  0 of  the  tree  in  order  to 
approximate  the  value  of  the  integer  solution.  This 
approximation  could  then  be  used  as  the  target  value  for 
the  subgradient  optimization.  In  practice,  the  user  has 
the  option  of  providing  a Optimum/fielaxation  ratio  which 
the  HELSINKI  algorithm  will  use  to  determine  a target 
value  for  the  subgradient  optimization  procedure  after 
level  0.  If  the  user  guesses  this  ratio  to  be,  say, 
1.30,  the  HELSINKI  algorithm  will  multiply  the 
relaxation  value,  obtained  by  the  subgradient 
optimization  procedure  at  level  0,  by  1.30.  The 
resulting  value  will  be  used  as  a target  value  for 
subsequent  subgradient  optimizations  and  as  an  upper 
bound  on  the  problem.  Once  a solution  better  than  this 
target  value  is  found  it  is  subsequently  used  as  the 
target  and  upper  bound. 

As  the  value  of  the  Lagrangean  approaches  the 

t t I t|2 

target  value  the  perturbation  tern  d[H-U(U)]/|s| 
becomes  very  small.  It  is,  therefore,  very  difficult 
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foe  the  LagDngean  to  exceed  the  target..  Consegueiitly, 
rather  thac  using  the  incuobent/  the  best  known  solution 
to  the  problem,  as  a target  value;  we  have  made  a 
practice  of  using  1.2  times  the  incumbent  for  the 
target.  We  have  found  this  to  be  very  effective. 

j 

i 

The  second  degree  of  freedom  in  determining  is 
the  choice  of  the  multiplier  d^ . The  theoretical 
restrictions  on  d ^ are  that  at  each  iteration  d*^  be 
between  0 and  2,  that  the  sum  of  the  d^'s  must  diverge 
and  that  d^  must  go  to  0 in  the  limit  [Held,  Wolfe  & 

Crowder  (1974)  ].  Our  practice  has  been  to  start  with  d*" 
large,  do  a number  of  iterations  and  then  reduce  the 
size  of  d^.  We  continue  in  this  manner  until  either  the 
perturbation  term  goes  to  zero  or  we  have  reached  a 
preset  number  of  iterations.  In  practice,  we  do  1 
iterations  with  d*^,  then  set  L = L/2  and  d*”  = d^/2  and 
proceed.  Both  the  initial  L and  the  initial  d^  depend 
on  the  current  level  of  the  tree  (see  Table  7.3).  The 
subgradient  process  terminates  when  d^'is  less  than  the 

minimum  d^  for  that  level.  Below  level  2 the  parameters  | 

i 

are  set  as  they  are  for  level  2.  ; 


r 


1 


Level 

Initial 

Minimum 

Initial 

Minimum 

L 

L 

d® 

d^ 

0 

320 

10 

2.0 

0.0078125 

1 

80 

10 

1.0 

0.03125 

2 

40 

10 

0.25 

0.03125 

L Is  the  number  of  iterations  done  with  the  current 
value  of  the  multiplier  d^. 


Parameter  Values  for  the  Subgradient  Optimization 


\ 


Table  7.3 
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7.4 


Computation  Times  for  Four  Test  Problems 


1 

1 


ii 

I' 


Table  7. 1 indicates  run  tines  for  the  four  problems 
under  the  following  conditions.  No  side  constraints 
were  considered.  Ko  guess  was  admitted.  No  Optimum/ 
Relaxation  ratio  was  given.  In  other  words  Table  7.1 
gives  the  results  for  the  HELSINKI  algorithm  when  the 
straight  Helsinki  problem  was  considered  with  no  outside 
information.  Included  in  the  table  are  the  results  for 
the  set  partitioning  algorithm  SETPAR  under  the  same 
conditions.  CPU  seconds  are  on  a CYBER  175  computer 
using  FORTEAU  IV. 


Table  7. 4 indicates  run  time  for  both  algorithms 
incorporating  a Optimum/Reiaxation  ratio.  It  should  be 
noted  that  since  the  two  algorithms  use  different 
methods  (subgradient  optimization  and  linear  program- 
ming) to  get  relaxation  values  at  the  top  of  the 
decision  tree  the  same  ratio  will  l<.ad  to  different 
upper  bounds  for  the  two,  where  upper  bound  equals  ratio 
times  relaxation  value.  (Recall  that  the  subgradient 
procedure  stops  short  of  optimality.)  Consequently,  we 
have  chosen  ratios  for  tho  HELSINKI  algorithm  to  give 
the  same  resulting  upper  bound  that  SETPAR  calculated. 
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Problem  SETPAR  Resulting  HELSINKI  SETPAR  Problem 
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Results  wheo  Side  Coostraiots  are  Included 


The  HELSINKI  algorltha  oay  be  oodified  to  handle 
side  constraints.  A common  side  constraint  in  crev 
scheduling  problems  is  that  the  number  of  variables  in 
the  solution  be  less  than  or  egual  to  some  number  K. 

N 

Z X.  < K 
j=l  ^ 

As  mentioned  in  Chapter  5,  the  HELSINKI  algorithm 
can  take  advantage  of  this  side  constraint  in  its  prime 
number  - shortest  path  relaxation.  However#  the 
HELSINKI  algorithm  utilizes  an  additional  technique  to 
include  this  side  constraint.  At  each  node  of  the 
branch  and  bound  search  tree#  there  are  now  two  phases. 
In  phase  one  we  consider  a new  problem  formulated  by 
replacing  the  objective  function  of  the  original 
Helsinki  problem  with  our  new  constraint  to  form  the  new 
objective  function 

N 

Hin  Z X 
j=l  ^ 

lie  use  the  same  techniques  developed  in  Chapters  2 
through  4 to  solve  this  new  problem.  Ne  form  the 
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Lagrangean  relaxation  and  use  subgradient  optimization 
to  tighten  that  relaxation.  Using  the  backward-forward 
Algorithm  3.3  to  evaluate  the  relaxation,  we  are  able  to 
eliminate  many  arcs  which  cannot  participate  in  a 
solution  containing  at  most  K variables.  If  we  fail  to 
fathom  the  node  using  the  side  constraint  as  the 
objective  function,  we  then  proceed  to  phase  two.  The 
second  phase  entails  solving  the  regular  subproblem 
(with  the  original  objective  function) . The  elimination 
of  variables  and  their  partners  in  phase  one  strengthens 
our  relaxation  in  phase  two. 

Both  SETPAB  and  HELSINKI  are  capable  of  handling 
the  above  restriction  on  the  number  of  variables  in  the 
optimal  solution-  Table  7.5  compares  the  two  algorithms 
for  problems  with  this  side  constraint. 

7.6  Decision  Trees  for  the  Four  Test  Problems 

In  order  to  give  an  impression  of  the  actual 
computation  of  the  algorithm  in  solving  the  four  test 
problems,  we  include  the  decision  trees  generated  in 
problems  1 and  4.  See  Figures  7.1  and  7.2.  Some 
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special  notation  has  been  employed^  Nodes  with  an  X 
through  them,  0 , indicate  nodes  that  were  fathomed  when 
the  logical  reduction  techniques  demonstrated  that  the 
problem  was  infeasible  at  that  node..  Nodes  where  a 
feasible  solution  was  found  are  underlined  and  the  value 
of  the  feasible  solution  is  entered.  The  node  where  the 
optimal  solution  was  found  is  underlined  twice.  All 
other  hanging  nodes  were  fathomed  by  bounding.  Those 
nodes  which  were  fathomed  using  the  prime  number  - 
shortest  path  algorithm  are  depicted  by  squares  rather 
than  circles. 


7.7  Results  of  the  Prime  Number  - Shortest  Path 
Algorithm 

For  the  runs  reported  here  the  prime  number  - 
shortest  path  subroutine  was  called  whenever  the  number 
of  active  variables  became  less  than  60.  He  have  also 
made  runs  using  a cut  off  of  up  to  125  variables.  He 
have  found  75  to  be  most  effective  for  easy  problems 
(numbers  1 and  4)  and  60  to  be  most  effective  for  the 
two  hard  problems. 
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It  was  our  experience  with  a cutoff  of  60  that  the 
number  of  nodes  generated  by  the  prime  number  - shortest 
path  algorithm  was  very  small  (generally  on  the  order  of 
100) . Problems  2 and  3,  however,  generated  an  inordin- 
ate number  of  Knifedge  problems  requiring  a thousand  or 
more  nodes.  See  Figure  7.3. 

In  order  to  gain  insight  into  the  strength  of  using 
the  prime  number  algorithm  vje  ran  two  of  the  test 
problems  without  calling  the  prime  number  - shortest 
path  subroutine.  The  results  are  in  Table  7.6.  It  has 
been  our  experience  that  the  prime  number  subroutine 
leads  to  time  savings  ranging  from  5%  to  3055,  the 
average  being  about  10  to  15X. 

Finally,  we  give  results  for  a prime  number  - 
shortest  path  algorithm  developed  for  solving  general 
set  partitioning  problems.  Problems  were  obtained  by 
eliminating  rows  from  the  four  Helsinki  test  problems. 
See  Table  7.7. 
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Threshold;  When  the  number  of  variables  in  a subproblem 


Is  less  than  or  equal  to  the  threshold,  the 
prime  number  - shortest  path  subroutine  is 
called. 

- 

Run  times  are  CPU  seconds  on  a CYBER  175  using  FORTRAN  IV. 


Run  Times  for  the  HELSINKI  Algorithm  with  Different  Threshold 
Values  for  Calling  the  Prime  Number  - Shortest  Path  Subroutine 


Table  7.6 
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Derivation 

Size 

Nodes 

Time 

1 

20  X 85 

114 

0.698 

2 

20  X 39 

60 

0.569 

3 

20  X 41 

44 

0.444 

4 

20  X 46 

147 

0.486 

1 

30  X 168 

2314 

1.651 

2 

30  X 79 

767 

0.901 

3 

30  X 89 

1076 

0.873 

4 

30  X 90 

2096 

1.256 

1 

40  X 271 

>20,000 

>7.056 

2 

40  X 147 

10,745 

6.551 

3 

40  X 139 

15,100 

9.699 

4 

40  X 169 

>20,000 

>7.562 

size: 

Derivation: 

Nodes : 

Time: 


Rows  X Columns 

Number  of  test  problem  which  was  truncated  to 
form  this  problem 

Number  of  nodes  generated 

Solution  time  In  CPU  seconds  on  a CYBER  175 


Results  for  the  Prime  Number  - Shortest  Path  Set  Partitioning 

Algorithm 


Table  7.7 
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PART  III  Extensions  and  Discussion 


CHAPTER  8 

THE  CIRCULAR  ONES  PROF  M 

In  this  chapter  we  consider  the  circular  ones  or 
cyclical  staffing  problem.  This  is  the  problem  of 
scheduling  workers  in  a planning  horizon  that  has  a 
cyclical  nature.  ve  consider  the  circular  ones  set 
partitioning  problem.  We  show  that  it  may  be  solved  by 
solving  K shortest  path  problems  where  K is  less  than 
the  number  of  rows  or  columns  in  the  original  problem. 

He  interpret  the  circular  ones  problem  as  one  of  trying 
to  find  the  shortest  path  around  a circle.  In  the  final 
section  we  consider  networks  on  a circle. 
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8. 1 The  Circular  Ones  Problea 

The  circular  ones  problem  is  a special  case  of  the 
two  duty  period  scheduling  problem  that  has  been  studied 
in  some  detail  [Tucker  (1971),  Tibrewala,  Philippe  & 
Browne  (1972),  Baker  (1974),  Brownell  6 Lowerre  (1976), 
Bartholdi,  Orlin  8 Ratliff  (1977),  Bartholdi  & Ratliff 
(1977)  ].  The  model  represents  continuous  workshifts  in 
cyclical  time.  That  is,  each  person  works  a single  duty 
period  with  no  break.  By  cyclical  time  we  mean  that  the 
planning  horizon  is  of  a definite  duration  having  a 
cyclical  nature  (a  day,  week,  etc.);  however,  the  time 
chosen  to  demarcate  the  beginning  and  end  of  the 
planning  horizon  is  arbitrary.  Take  for  example,  the 
problem  of  determining  the  daily  work  schedule  for  a 
continuous  operation,  see  Figure  6.1.  The  duration  is 
determined,  twenty-four  hours.  However,  saying  that  a 
day  begins  and  ends  at  midnight  is  arbitrary.  Generally 
a single  duty  period  corresponds  to  a single  segment  in 
the  column.  However,  since  we  are  now  dealing  with 
cyclical  time,  a duty  period  may  extend  from  the  night 
of  one  day  to  the  morning  of  the  next.  See,  for 
example,  in  Figure  8.1. 
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Consequently,  the  circular  ones  problem  is  a 
special  case  of  the  two  duty  period  scheduling  problem. 
Indeed,  one  could  expect  most  of  the  columns  to  have 
only  one  segment.  In  this  case  the  network  relaxation 
would  be  strong  and  the  subgradient  optimization  could 
be  expected  to  converge  rapidly. 


( 


8.2  The  Circular  Ones  Set  Partitioning  Problem 

When  the  circular  ones  problem  is  a set  partition- 
ing problem  (P) 


(P)  Min  CX 

St  AX  = 1 

X = 0, 1 

the  HELSINKI  algorithm  would  be  expected  to  find  an 

optimal  solution  quickly.  However,  the  circular  ones 

set  partitioning  problem  can  be  solved  more  easily.  It 

requires  solving  only  K shortest  path  problems  where  K ^ 

is  the  greatest  number  of  nonzero  entries  in  a column  or  ^ 

the  smallest  number  of  nonzero  entries  in  a row, 

i 

whichever  is  less.  ^ 

168  I 


To  show  this,  consider  the  solution  to  the  circular 
ones  set  partitioning  problem.  Recall  that  each  column 


of  A represents  a CONTINUOUS  SHIFT  in  cyclical  time, 
corresponding  to  one  or  two  segments  of  ones  in  the 
column.  Choose  a variable,  Xj,  in  the  solution.  Let  I 
be  the  first  row  of  the  continuous  duty  period  of  Aj 
(i.e.  choose  I such  that  a^j  =1,  a^_|^  j =0;  or  else 
1=1).  Then  reorder  the  rows  in  the  problem  according 
to  the  transformation  Q^: 

Q j(i)  = i-  I + 1 i>I 

= l!-I  + i+  l i<I 

Qj  has  the  effect  of  rolling  the  constraint  matrix 
around,  in  the  sense  that  row  I becomes  row  1 and  the 
rows  retain  their  same  sequential  ordering,  where  row  1 
follows  the  last  row,  row  N.  This  is  equivalent  to 
starting  24  hour  days  at  some  arbitrary  time,  say  4 am, 
and  finishing  them  at  the  same  time  the  following  day. 
Under  this  transformation  the  columns  will  still  have 
the  circular  ones  property.  Consider  the  problem 
created  by  eliminating  all  columns  having  more  than  one 
segment  after  this  transformation.  This  new  problem  is 
simply  a one  duty  period  set  partitioning  problem  which 


t 
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¥e  have  showa  in  Chapter  2 to  be  equivalent  to  a 
shortest  path  problem.  Moreover,  the  optimal  solution 
to  the  original  circular  ones  set  partitioning  problem 
will  be  an  optimal  solution  to  this  new  problem. 

Let  K be  the  maximal  number  of  nonxero  entries  in  a 
column  of  the  original  circular  ones  set  partitioning 
problem.  Then,  if  we  consider  any  K consecutive  rows, 
any  feasible  solution  to  the  circular  ones  problem  must 
contain  a continuous  duty  period  which  begins  in  one  of 
those  K rows.  To  show  this,  consider  the  first  of  the  K 
rows,  I.  Bow  I is  covered  by  some  variable  Xj  of  the 
feasible  solution.  If  the  continuous  duty  period  of  X^ 
starts  in  row  I,  we  are  finished..  If  not,  let  k be  the 
last  row  of  the  continuous  duty  period  of  X^.  Then 
Q ^(k)  < K.  Otherwise,  Xj  contains  more  than  K nonzero 
entries.  Then  row  k ♦ 1 is  covered  by  a continuous  duty 
period  which  begins  in  that  row,  where  M + 1 = 1. 


Consider  now  the  following  algorithm. 


STEP  1 


Let  1=1+1 


Apply  the  transfornation  Q^to  the  circular 
ones  problem.  Eliminate  all  columns  now 
having  more  than  one  segment,  since  none 
of  these  columns  can  participate  in  a 
solution  having  a continuous  duty  period 
starting  in  row  i.  Solve  the  transformed 
problem  as  a shortest  path  problem.  The 
solution  will  be  an  optimal  solution  under 
the  restriction  that  the  solution  contain 
a continuous  duty  period  starting  in  row 
I. 

If  I < K go  to  STEP  1. 

STEP  2 Of  the  K solutions  generated,  choose  the  best. 

STOP 

Algorithm  8.1  will  solve  the  original  circular  ones 
problem.  We  have  shown  this  since  the  solution  will 
have  a continuous  duty  period  starting  in  one  of  the 
first  K rows,  say  row  I.  Then  the  Ith  iteration  of 
Algorithm  8. 1 will  find  that  solution.  Of  course  the 
solutions  found  at  each  iteration  of  the  algorithm  will 
be  feasible  solutions  to  the  original  circular  ones 
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problen,  since  the  shortest  path  problca  generated  is  a 
restriction  of  the  original  problem. 

Likewise  we  can  show  that  (P)  (for  an  example  see 
Figure  6.2)  can  be  solved  by  solving  K shortest  path 
problems,  where  K is  the  smallest  number  of  nonzero 
entries  in  a row.  Repeat  the  first  constraint  as  an  H 
1st  constraint  to  give  an  enlarged  constraint  matrix  A'. 
See  Figure  8.3.  Apply  transformation  I of  Chapter  2 to 
the  constraint  matrix  A'.  Then  TA'  can  be  partitioned 
into  {a^lCTA')*]  where  a^  is  the  first  row  of  A (recall 
that  T leaves  the  first  row  of  A unchanged),  and  (TA')* 
is  a matrix  representing  a shortest  path  problem 
[Bartholdi,  Orlin  & Ratliff  (1977)  ].  See  Figure  8.4. 

In  other  words,  (P)  may  be  rewritten  as  a shortest  path 
problem  with  one  complicating  constraint  of  the  form 

N 

’ ‘IJ  “ 

This  complicating  constraint  simply  requires  that 
only  one  of  the  variables  X ^ for  which  a^^  » 1 can  be 
nonzero.  The  following  algorithm  wil?  solve  (P) . 
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Figure  8 . 2 
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The  Circular  Ones  Example  of  Figure  8.2  with  the  First  Constraint 
Repeated  as  a Thirteenth  Constraint 


Figure  8 . 3 
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The  Circular  Ones  Example  of  Figure  8.3  Transformed  by  Tj^ 


Figure  8 . 4 
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ilgorithm  8.2 


STEP  0 Find  the  row  I with  the  least  nucber  of 
nonzero  entries. 

Apply  transformation  to  (P) . 

Let  j = 0. 

STEP  1 Let  j = j + 1 

If  aj^j  =0  GO  TO  STEP  1 
Solve  the  shortest  path  problem  defined 
by  (TA')*  with  Xj  = 1, 

If  j < N GO  TO  STEP  1 

STEP  2 Of  the  solutions  generated,  choose  the  best. 
STOP 

It  is  an  interesting  feature  of  Algorithm  8.2  that 
the  shortest  path  problems  solved  will  all  be  acyclic. 
This  is  because,  in  the  transformed  constraint  matrix, 
precisely  those  columns  representing  arcs  going  backward 
are  the  columns  which  are  set  equal  to  zero  or  one 
before  the  shortest  path  problem  is  solved. 
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8.3 


Shortest  Path  on  a Circle 


There  is  an  interesting  interpretation  to  Algorithm 
8.1.  Suppose  the  original  circular  ones  set 
partitioning  problem  has  H rows.  Consider  a network 
with  H nodes.  Think  of  the  N nodes  as  being  arranged  in 
a circle  (see  Figure  8.5  corresponding  to  Figure  8.2). 
Interpret  each  variable  of  the  problem  as  an  arc 
beginning  at  the  node  corresponding  to  the  row  in  which 
the  continuous  duty  period  of  the  variable  begins,  and 
ending  at  the  node  after  the  one  corresponding  to  the 
row  in  which  the  continuous  duty  period  of  the  variable 
ends.  See  Figure  8.5.  Enter  the  arcs  on  the  network 
diagram  by  always  directing  them  clockwise  around  the 
circle  of  nodes.  The  circular  ones  set  partitioning 
problem  corresponds  to  finding  the  shortest  path  on  a 
circle.  One  need  only  consider  any  node,  say  I,  and 
solve  the  shortest  path  problem  where  node  1 is  both  the 
origin  and  the  destination  for  the  problem.  Then 
resolve  the  problem  using  node  I 4’  i as  the  origin/ 
destination.  Continue  in  this  manner  until  the  shortest 
path  problem  has  been  solved  for  K consecutive  origin/ 
destinations  where  K is  the  greatest  number  of  nonzero 
entries  in  a column.  This  guarantees  finding  the 
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shortest  path  around  the  circle.  This  is  because  the 
solution  must  contain  an  arc  which  begins  at  one  of  the 

I 

K nodes  considered,  since  no  arc  is  longer  than  K ♦ 1 
, nodes. 

Algorithm  8.2  may  be  interpreted  in  a similar 
manner.  If  (P)  is  considered  as  a shortest  path  problem 
on  a circle,  find  the  node  I with  the  least  number  of 
it  arcs,  say  K,  starting  at  that  node  or  passing  over  that 

node.  Then  solve  the  shortest  path  problem  starting  and 
ending  at  node  I,  K times.  Each  time  set  one  (a  dif- 
ferent one)  of  the  K arcs  passing  over  or  starting  at  | 

i 

node  I equal  to  one,  and  the  other  K - 1 equal  to  zero.  | 

i 

f 
1 

8.4  Networks  on  a Circle  1 


These  intepretations  extend  to  the  general  circular 
ones  problem  (P) 


(P) 


Min  CX 

St  AX  = b 

X Integer,  Nonnegative 
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Rov  the  problea  is  one  of  a oetvork  on  a circle. 
Consider  the  example  in  Figure  8.1.  Let  I be  the  row  of 


the  A matrix  with  minimum  b^^^  Then  apply  transformation 
to  A.  See  the  example  in  Figure  8.6.  Nov  repeat  the 
first  constraint  as  an  n 1st  constraint  to  the  problem 
and  apply  transformation  T of  Chapter  2 to  the  problem 

I (see  Figures  8.7  and  8.8).  The  problem  has  been 

transformed  to  a network  flow  problem  with  one  side 

constraint  a X = b , where  a,  is  the  1th  row  of  A. 

1,  III 

t' 

I , 

j (P*)  Min  CX 

st  a^  X = bj 
A*X  = b* 

X Integer.  Nonnegative 

I 

Interpreting  (P*)  in  our  circular  network  model,  we 
see  we  have  a normal  network  problem  with  the  additional 
constraint  that  the  amount  of  flow  passing  through  Ofi 
PAST  node  1 must  equal  b^. 


Formulation  (P*)  suggests  how  easy  it  might  be  to 
solve  the  circular  ones  problem  using  Algorithm  3.1. 
There  is  only  one  complicating  constraint  in  (P')« 
Therefore,  we  form  the  Langrangean  relaxation 
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’/ 


Min  2 C X 

j=l  ^ ^ 


8t  X,  X-  X,  X,  X,  X,  X,  RHS 

1 / J 4 J O / 

4 am  -6  am  0 1 1 0 0 0 0 »•  1 

6am-8am  0 1 1 0 0 0 0 = 1 

8am-10am  0 1 0 0 1 0 0 « 2 

10  am  - 12  m 0 1 0 0 1 0 0 = 1 

12  m-2  pm  0000101  = 2 

2 pm  -4  pm  0 0 0 0 1 0 1 = 1 

4pm-6pm  1 0 0 0 0 0 1 = 1 

6pm-8pm  1 0 0 0 0 0 1 = 1 

8 pm  -10  pm  1 0 1 0 0 0 0 = 1 

10  pm  - 12  pm  1 0 1 1 0 0 0 - 2 

12pm-2am00  1 1010  = 3 
2am-4am  0 1 1 1 0 0 0 ■ 2 


Xj  > 0,  Integer  j - 1,2 7 


The  Circular  Ones  Example  of  Figure  8.1  Transformed  by 


Figure  8 . 6 
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The  Circular  Ones  Example  of  Figure  8.6  with  the  First  Constraint 
Repeated  as  a Thirteenth  Constraint 


Figure  8 . 7 
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> 0,  Integer  j = 1,2 7 


The  Circular  Ones  Example  of  Figure  8.7  Transformed  by  T^^ 


Figure  8.8 


r 


11(0)  = -Ub^  ♦ Min  (C  + Oa^)X 


St 


X in  S 


where  S is  the  set  of  solutions  to  the  network  flow 
problem  defined  by  A*X  = b*.  H{U)  is  a concave 
piecewise  linear  function  and  0 is  a scalar.  Con- 
sequently, the  subgradient  optimization  reduces  to  a 
simple  line  search  for  a concave  function. 
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CHAPTER  9 


THE  K DUTY  PERIOD  SCHEDULING  PROBLEM 


In  this  chapter  we  consider  the  K duty  period 
scheduling  problem.  After  describing  the  problem  in 
Section  9.1,  we  show  in  the  next  section  that  the 
Lagrangean  relaxation  is  once  again  a network  flow 
problem.  In  Section  9.3  we  demonstrate  that  subgradient 
optimization  can  be  used  to  tighten  the  relaxation. 
Finally,  we  interpret  the  iterative  process  of  the 
subgradient  optimization. 


9.1  The  K Duty  Period  Scheduling  Problem 

Although  thus  far  we  have  dealt  exclusively  with 
the  two  duty  period  scheduling  problem,  it  is  quite 
possible  to  extend  our  development  to  the  more  general  K 
duty  period  scheduling  problem  where  K = 1,2,3,....  The 
formulation  would  be 
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applications.  The  first  is  simply  the  two  duty  period 
scheduling  problem  on  a continuous  twenty-four  hour 
operation.  This  is  analogous  to  the  circular  ones 
problem  discussed  in  the  previous  chapter.  For 
example,  consider  the  problem  of  assigning  drivers  to 
buses  which  run  24  hours  a day.  It  is  not  possible  to 
formulate  this  problem  as  a two  duty  period  scheduling 
problem  so  long  as,  for  each  row  of  the  constraint 
matrix,  there  exists  at  least  one  duty  period  covering 
that  row  but  not  starting  in  it. 


Suppose  we  are  concerned  with  Bus  A.  Any  24  hour 
(or  other)  period  need  not  have  exactly  two  segments  in 
each  column  in  the  constraint  matrix.  See  Figure  9.1. 


I 

j 
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Xj  ^ 0,  Integer  j - 1,2,..., 6 


An  Example  of  the  Circular  Ones  Problem  with  Two  Continuous 
Duty  Periods  for  Each  Worker 


Figure  9.1 
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Sheceas  possible  driver  schedules  correspond  to  one 
or  two  segnents  (e.g.  X^)  others  might  inevitably 

lead  to  three  segments  (e.g.  X^).  Due  to  the  wrap 

around  nature  o£  the  continuous  schedule  a duty  period 
might  begin  near  the  end  of  the  column  <e.g.  10  pm)  and 

terminate  near  the  beginning  of  the  column  (e.g.  2 am). 

This  situation  leads  to  three  segments  in  the  column. 
Consequently,  the  three  duty  period  scheduling  problem 
corresponds  to  scheduling  personnel,  each  having  at  most 
two  duty  periods  in  a given  time  period  (a  day) , where 
the  job  must  be  performed  continuously  from  day  to  day. 

• 

It  is  not  difficult  to  consider  examples  of  the  K 
duty  period  scheduling  problem  where  K is  greater  than 
3.  Such  a problem  corresponds  to  scheduling  personnel 
to  serve  at  most  K duty  periods  in  a given  planning 
horizon,  for  example  five  duty  periods  in  seven  days. 
Such  a problem  would  be  a K duty  period  scheduling 
problem  in  the  case  of  linear  time  or  a K ♦ 1 duty 
period  scheduling  problem  in  the  case  of  cyclic  time. 

An  interesting  extension  of  this  idea  is  that  the 
general  H X N set  partitioning  problem  can  be  viewed  as 
an  C(ll4>1)/2]  duty  period  scheduling  problem,  where 
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• 


t (H  + 1)/2]  is  the  largest  integer  less  than  or  equal  to 
(M+1)/2.  This  is  simply  because  [ (M+1)/2]  is  the 
greatest  number  of  segments  possible  in  a column  with  H 
rows. 

9.2  The  Felaxation 

Consider  the  K duty  period  scheduling  problem  (P) 

(P)  Hin  CX 

St  AX  b 

X Integer,  Nonnegative 

where  each  column  contains  at  most  K segments.  Our 
relaxation  will  be  to  decouple  the  K segments  in  each 
column  to  give  K new  columns  each  with  one  segment.  The 
relaxation  is  then  a network  flow  problem.  To  create  a 
problem  equivalent  to  (P)  one  must  add  side  constraints 
forcing  the  K variables  replacing  each  original  variable 
to  be  equal.  We  have  already  proved  in  Proposition  2. 1 
that  (P*)  is  equivalent  to  (P) . 
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N 


(P*)  Hin  Z E ci^ 

j-1  k=l  J J 

„ k. 


N JM 

S 51  Aj3^  = 

J=1  k=l  J ^ 


T ^ — Y s 0 

J j 

k . _ 


j = 1,.,..,N;  k = 


Xj  >0,  Integer  j = k = 


vhere 


"J 

^ = C . 

k-1  J J 


/ Np  kj  M 


A column  containing  K segments  in  the  original 
problem  (P)  will  correspond  to  K arcs  in  the  relaxation. 
If  all  of  these  partner  arcs  (corresponding  to  the  same 
original  column)  have  the  same  value  in  the  relaxation, 
then  there  is,  in  essence,  no  relaxation.  The  more  the 
partner  arcs'  values  differ,  the  looser  the  relaxation 
will  be.  In  tnis  sense,  we  can  say  that  the  larger  K 
is,  the  weaker  the  network  flow  relaxation  will  be^ 

This  is  simply  because  the  number  of  arcs  has  been 
increased. 


9.3  Tightening  the  Relaxation 


Once  again  it  would  be  possible  to  use  subgradient 
optimization  to  tighten  the  relaxation.  Consider  the  K 
doty  period  scheduling  problem  (P*)  which  nay  be 
rewritten  as  (P") 

N '"j 


(P")  Hin  EE  cSj 
» . J-l  k-l  ■'  ’ 


X In  S 

st 


Xj  - = 0 j = 

k = 


where  S is  the  set  of  solutions  to  the  network  flow 
problem  given  by  the  constraints 


N 

EE  ‘J 

j-l  k-l  J 


= b 


Then  the  Langrangean  relaxation  of  (P")  is 


N 


k-l 
N j 


i(0) 


Hin  Y,  E ♦ Z)  E 

X in  S j=l  k-l  •’  J-l  k-l  ^ ^ 


Hin 
X In  S 


(Cj  ♦ U J - ) X J 


where  Uj  = 0 , ® j ^ ^ ® 
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and  0 = (o},.-.,oJl,U2.— ,oJjN) 

Since  H (0)  has  the  integrality  property,  nax  V (0)  will 
be  equal  to  the  linear  prograaning  relaxation  of  (P) . 


I 


! 

i- 


Consider  the  function  H(0).  This  is  a piecewise 
linear  concave  function,  since  it  is  the  pointwise 

A 

minimum  of  a family  of  linear  functions  of  U.  Suppose  X 

= (xJ,...,Xjl,X2,...,X^)  is  optimal  for  H (0) . Then  as 

'"k  '^kfl 

we  have  shown  in  Chapter  4,  Section  4.1,  {X  - X ) is 

A 

a subgradient  of  W at  U,  where 


^ki-1  -ki 


''ki 


. N, 


Consequently,  we  may  again  use  suLgradient 

A 

optimization  to  tighten  our  relaxation.  Let  N be  the 
target  value.  Then  at  iteration  t ■»  i,  we  define 


Qt+l  = ♦ d‘^[W  - H(u‘^)]sV 


t ^ k * k+1  t 

where  s = (X  ~ X ) 


^Ic  ^kr^l 

= (X  - X ) at  iteration  t 
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where  satisfies  0 < d £ 2«  d ->  0, 

00 

and  S = ® - 

t-1 


9.4  Interpreting  the  Subgradient  Optimization  Iteration 

To  interpret  the  iterative  step  defining  the 

■altipliers  at  iteration  t ♦ 1,  let  us  develop  a little 

terninology.  Each  variable  X in  our  original  problem 

1 2 3 k-i 

was  decoupled  into  variables  (X  .,Xj ,Xj ,XjJ ) . For 
each  variable  Xj,  define  its  FABTNEfiS  to  be  its 
PREDECESSOB  ^ SUCCESSOR  X , where  they 

exist  (i.e.  k ^ 1,kj).  Define  the  ASSOCIATES  of  3^  to 
' be  Xj  , for  all  i # k,  1 4 i 4 k^.  How,  at  iteration 
t ♦ 1; 


i 


r 


Hole  for  the  case 


2,  Case  3 above  becomes 


Case  3*;  It  = = 2 

= (Cj)  ^ U - II(U‘')  ]£  (Xj)  - (xj)*']/  |s*^|^ 

and  Cases  2 and  3*  define  the  iterative  formula 

developed  in  Chapter  4 for  C^^^  for  the  two  duty  period 

1 2 

scheduling  problem,  where  Xj  = Yj  and  Xj  = Zj . 

Let  us  examine  the  general  formula  given  in  Case  1 
above.  Recall 

s = (Sj)  j = X = 

Then 


t 2 

So  |s  I is  equal  to  the  number  of  times  a decoupled 
arc  is  different  than  its  successor.  Note  that  in  Case 

1 above 
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-2  < - ♦ 2(xjf  - < 


Let  P be  the  penalty  at  iteration  t ♦ 1 


Then 


[i  - M(U^  ]/  |s*^|2 


k . t + 1 


(Cj  ) 


(Cj  } ^ ♦ 2P^  if  Xj  is  in  the  current 

solution  but  neitner  its  suc- 
cessor nor  its  predecessor  is 


(Cj  = 


k t t k. 

(Cj  ) ♦ P if  Xj  is  in  the  current  solu- 

tion and  either  its  successor 
or  predecessor  (but  not  both) 
is  also  in 


-k  . t+1 


k . t 


(Cj  ) = (Cj  ) 


if  Xj  and  its  successor  and 
predecessor  are  either  all  in 
or  all  not  in  the  current 


solution 


The  K duty  period  scheduling  problem  may  be  handled 
in  a way  analogous  to  the  two  duty  period  scheduling 
problem.  By  decoupling  the  segments  in  each  column  the 
problem  can  be  reformulated  as  a network  flow  problem 
with  side  constraints,  in  the  case  of  a set  parti- 
tioning problem  the  relaxation  becomes  an  acyclic 
shortest  path  problem.  When  an  arc  is  chosen  to  be  in 
the  solution  to  the  shortest  path  relaxation#  all  of  its 
associates  are  also  chosen..  It  is  then  possible  to 
eliminate  all  arcs  (and  their  associates)  which  conflict 
with  these  chosen  arcs.  Similarly#  using  the 
backward-forward  iterative  solution  procedure  for  the 
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shortest  path  problem,  it  is  possible  to  eliminate  arcs 
which  cannot  participate  in  a solution  better  than  the 
best  known  solution.  Of  course,  when  an  arc  is 
eliminated,  its  associates  are  also  eliminated. 
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CHAPTER  10 

GENERAL  METHODOLOGY 

In  this  chapter  we  consider  the  general  procedure 
of  decoupling  columns  to  form  problems  amenable  to 
Lagrangean  relaxation  techniques.  It  is  shown  that  the 
methodology  is  applicable  to  near  block  diagonal 
matrices. 

10.1  Solution  Procedure  for  the  Two  Duty  Period 
Scheduling  Problem 

The  previous  chapters  have  been  devoted  to 
developing  a solution  procedure  for  the  two  duty  period 
scheduling  problem.  In  this  chapter  we  try  to  examine 
the  methodology  that  has  been  developed  and  see  to  what 
extent  it  may  be  generalized. 

He  began  with  the  two  duty  period  scheduling 
problem  (P)  : 
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(P) 

Min 

CX 

* i 

i 

St 

AX  = b 

} 

X Integer,  Nonnegative 

A 

where  each  column  of 

A contained  exactly  two  segments  of 

ones.  As  such. 

the 

problem  could  be  handled  only  by 

k 

general  integer  programming  teciinigues.  However,  it  was 
recognized  that  by  decoupling  the  columns,  the  problem 
could  bs  transformed  into  a network  flow  problem  with 
side  constraints.  These  side  constraints  were  very 
simple,  of  the  form  Y - Z = 0. 

' It  was  shown  that  by  putting  these  side  constraints 

into  the  objective  function,  a very  easily  evaluated 
Lagrangean  could  be  formed.  Moreover,  due  to  the  simple 
nature  of  the  side  constraints,  the  dual  problem  could 
be  solved  very  efficiently  by  subgradient  optimization 

techniques.  The  computation  involved  at  each  iteration 

i ; 

\,i  was  very  slight  compared  to  solving  the  linear  pro- 

gramming relaxation  of  the  original  two  doty  period 
scheduling  problem. 
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10.2  sroader  Applications  of  the  Methodology 
Developed  to  Solve  the  Two  Duty  Period 
Scheduling  Problem 

The  technique  used  in  solving  the  two  duty  period 
scheduling  problem  was  simply  to  decouple  the  columns  to 
achieve  an  easily  solved  problem  with  side  constraints. 
Are  there  other  situations  when  ti.e  same  thing  might  be 
done  profitably? 

There  is  the  immediate  extension  to  all  problems 
having  at  most  two  plus  ones  and  two  negative  ones  in 
each  column,  the  other  entries  being  zero.  Each  column 
could  then  be  decoupled  into  two  columns  each  containing 
one  plus  one  and  one  negative  one.  The  problem  could 
then  be  solved  as  a network  flow  problem  with  coupled 
arcs.  Similarly,  any  0-1  matrix  having  at  most  four 
nonzero  entries  in  a column  could  be  transformed  to  a 
matching  problem  with  coupled  arcs  [Nemhauser  (1978)]. 

He  look  now  to  other  applications  of  the  decoupling 
technique.  Consider  a block  diagonal  coefficient  matrix 
with  K blocks  and  with  complicating  variables  (see 
Figure  10.1).  Benders  Decomposition  is  a well  known 
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technique  for  handling  this  particular  problem.  An 

alternative  method  would  be  to  decouple  the  complicating 

variables.  Replace  each  complicating  variable  X with  K 
k k 

variables  X , k = where  X contains  the  entries 

of  the  rows  in  the  kth  block  in  the  block  diagonal 

structure  of  the  easy  variables.  Add  the  complicating 
k k+1 

constraints  X = X r k = 1,...,K-1.  Then  rather  than 
use  Benders  on  the  problem  it  may  be  advantageous  to 
dualize  with  respect  to  the  new  complicating  constraints 
and  solve  the  Lagrangean  relaxation  with  subgradient 
optimization. 

A special  case  of  the  near  block  diagonal  problem 
is  the  mathematical  programming  problem  with  staircase 
structure,  see  Figure  10.2.  In  this  problem  we  again 
have  K blocks.  However,  now  each  complicating  variable 
has  entries  in  only  two  (adjacent)  blocks.  Consequently, 
replace  each  complicating  variable  X with  Y and  Z where 
Y contains  the  entries  of  the  rows  in  the  first  block 
covered  by  X,  and  Z contains  the  entries  of  the  rows  in 
the  second  block.  Add  the  complicating  constraints  Y - 
Z.  The  problem  now  can  be  solved  using  Lagrangean 
relaxation  and  subgradient  optimization. 
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A sinilar  situation  arises  for  near  block  diagonal 
problems  vith  complicating  constraints,  see  Figure  10.3. 
Here  we  consider  the  simplest  case  of  only  two  blocks. 
Consider  a complicating  constraint 


k 


I 

J*1 


N 

I 

j=k+l 


b 


i 


This  constraint  nay  be  replaced  by  the  two  constraints 


I a i .X  ♦ Yi  = bi 


to  fora  an  equivalent  problem.  If  each  of  the  compli- 
cating constraints  is  replaced  by  two  constraints  in 
this  way,  the  problem  has  been  transformed  to  a near 
block  diagonal  problem  with  complicating  variables  (the 
I^).  Then,  as  we  have  shown,  the  complicating  variables 
may  be  decoupled  to  solve  the  problem  via  Lagrangean 
relaxation  and  subgradient  optimization.  This  technique 
may  be  easily  extended  to  a near  block  diagonal  problem 
with  K blocks  and  complicating  constraints. 
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A Near  Block  Diagonal  Problem  with  Complicating  Constraints 
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